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(57) Abstract 

Methods and apparauis for performing fluores- 
cence specuoscopy on a sample. A sample is irradi- 
ated with a fluorescence excitation fiber (30) and radi- 
ation is collected from the sample with a fluorescence 
collection fiber (60) and detected to form fluores- 
cence spectra. The sample is also illuminated with a 
reflectance illumination fiber and reflected light from 
the sample is collected at a plurality of collection 
positions and detected to form spatially resolved re- 
flectance spectra. The fibers may form a probe ar- 
ranged in concentric sections. The spectra are ana- 
lyzed by preprocessing and reducing the dimension- 
ality of the spectrsi data. 
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DESCRIPTION 

COMBINED FLUORESCENCE AND REFLECTANCE SPECTROSCOPY 
BACKGROUND OF THE INVENTION 

5 1. Field of the Invention 

The present invention relates generally to the fields of optical imaging. More 
particularly, it concerns apparatus and methods for combining fluorescence and 
refleaancc spectroscopy for the imaging of samples, including both in situ and ex situ 
imagining of body tissues. 

10 

2. Description of Related Art 

Cancer is one of the leading causes of death in the United States and in the 
world. In the United States alone, deaths frbtn cancer are estimated to number 
560.000 in 1997 (American Cancer Society Online, Cancer Facts & Figures). 

IS Currently, diagnosis and treatment of cancer^'folidw histopathologic evaluation of 
directed biopsies. However, the tissue removal necessitated by these techniques not 
only may alter the progression of the disease (Robbins and Kumar, 1984) but is also 
very costly. Improving the capability for in situ monitoring of disease progression 
could greatly enhance the ability to detect and U^ai cancer and precancer (Kelloff et 

20 a/., 1992). 

A growing number of clinical studies have demonstrated that fluorescence 
spectroscopy may be used to distinguish normal and abnormal human tissues in vivo 
in the skin, head and neck, genito-urinary tract, gastro-intestinal tract, breast, and 
brain. It is weU known that fluorescence intensity and lineshape are a function of both 

25 the excitation and emission wavelength in samples containing multiple chromophorcs, 
such as human tissue. A complete charactcriKUion of the fluorescence properties of 
an unknown sample requires measurement of a .fluorescence excitation emission 
matrix, in which the fluorescence intensity is recorded as a function of both excitation 
and emission wavelength. The field of analytical chemistry has exploited the 

30 fluorescence properties of different compounds, to identify and quantify them in 
mixtures. 
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Most clinical studies reported to date have measured fluorescence emission 
spectra at only a small number of excitation wavelengths (lypicaJly one to three) due 
to clinical requirements imposed on the size, speed and sensitivity of instrumentation. 
The choice of excitation wavelength has been based on factors which vary from study 
5 to study, but include laser availability and predictions of chromophores thought to be 
present in normal and abnormal tissues and measurements of fluorescence excitation 
emission matrices (EEM) of normal and abnormal tissues m vitro. While in vitro 
measurements of tissue EEMs are feasible using commercially available scaiming 
fluorimeters, several studies have demonstrated ^ that the optical properties of tissue 

10 change significantly when tissue is examined in vitro due in part to interruption of the 
blood supply, oxidation and small size of biopsies. Thus, in vitro smdies to select 
excitation wavelengths are of limited value. 

Several recent studies have suggested that differences in optical properties, 
assessed using diffuse reflectance spectroscopy, may be used, to discriminate normal 

15 and abnormal human tissues in vivo in the urinary bladder and the skin. Furthermore, 
measuring both fluorescence and diffuse reflectance, spectra may provide additional 
information of diagnostic value. 

A system capable of measuring spatially , resolved reflectance spectra and 
fluorescence excitation emission matrices in vivo, .would remove limitadons of many 

20 previous studies, potentially enabling prediction of excitation wavelengths that 
provide greatest discrimination of normal and abnormal tissues, as well as a better 
understanding of the relative diagnostic ability of changes in absorption, scanering 
and fluorescence properties of tissue. Although fiber optic systems to record 
fluorescence EEMs and reflectance spectra at a single spatial location have been 

25 reported, such systems have measured data from only a single spatial location, and 
have thus not been able to perform spatially resolved spectroscopy. Additionally, 
previous systems have not been well-adapted for in-vivo studies of various tissues, 

SUMMARY OF THE INVENTION 

30 

In one respect, the invention is an apparanis for performing fluorescence and 
spatially resolved reflectance spectroscopy on a sample, and it includes a light source, 
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a monochromator. a reflectance illumination fiber, a fluorescence excitation fiber, an 
imaging spectrograph, a fluorescence collection fiber, a reflectance collection fiber, 
and a detector. The monochromator is in optical communication with the light source. 
The reflectance illumination fiber is in optical communication with the light source. 
5 The fluorescence excitation fiber is in optical conmiunicaiion with the 
monochromator. The fluorescence collection fiber is in optical communication with 
the unaging spectrograph. The reflectance coUcction fiber is in optical 
communication with the imaging spectrograph, and is in spaced relation with the 
reflectance iUmnination fiber. The detector isv in optical communication with the 

1 0 imaging spectrograph. 

In other aspects, the light source may include a Xe arc lamp. The 
monochromator may include a double monochromator. The detector comprises a 
thcnno-electrically cooled CCD camera. The fluorescence excitation fiber and the 
fluorescence collection fiber may be integral;: .One or more of the fibers may be 

15 positioned flush with the sample. The apparatus: may also include a spacer positioned 
between one or more of the fibers and the sample. The reflectance illumination fiber, 
the fluorescence excitation fiber, die fluorescence collection fiber, and die reflectance 
collection fiber may define a fiber optic probe. The probe may be configured to be 
positioned wiUiin a trocar. The probe may include a center section and an outer 

20 section, and the fluorescence excitation fiber and the fluorescence collection fiber may 
be positioned in the center section, and the reflectance illumination fiber and tiie 
reflectance collection fiber may be positioned in tiic outer section. The apparams may 
include a plurality of fluorescence excitation and coUection fibers arranged in a 
circular bundle. The apparatus may include a plurality of reflectance collection fibers 

25 defining a plurality of collection positions. The. plurality of collection positions may 
be spaced between about 0 and about 10 millimeters from the reflectance illumination 
fiber. The reflectance collection fiber may define a collection position at about 180 
degrees relative to the reflectance illumination fiber. The reflectance collection fiber 
may define a collection position at about 90 degrees relative to tiie reflectance 

30 illumination fiber. The reflectance collection fiber may define a collection position at 
about 45 degrees relative to tiie reflectance illumination fiber. The apparatus may 
include one or more fibers in optical conununication witfi the light source and 
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configured to iUuminate the sample during operation of the apparatus. The apparatus 
may include a plurality of fluorescence excitaUon fibers arranged in one or more rows 
adjacent the monochromator. The apparatus may include a plurality of fluorescence 
excitation fibers and a plurality of reflectance collection fibers arranged in a single 
row adjacent the imaging spectrograph. The apparatus may include one or more 
unconnected fiben interspersed with the pluraUty of fluorescence excitation fibers and 
the plurality of reflectance collection fibers. Tim appums may include a fiber 
connected from the light source to the imaging spectrograph to monitor spectral 
output of the light source. The apparatus may include a controller coupled to the 
detector. 

In another respect, the invention is an apparams for measuring fluorescence 
and spatially resolved reflectance spectra of a sample. The apparams includes a light 
source, a monochromator, a fiber optic probe, an imaging spectrograph, and a 
detector. The monochromator is in optical communication with the light source. The 
fiber optic probe is in optical conomunication, with the light source and with the 
monochromator. The probe includes a plurality of . fluorescence excitation and 
coUecdon fibers in spaced relation and a pluraliQ? :of reflectance collecdon fibers in 
spaced relation with a reflectance illumination fiber. The imaging spectrograph is in 
optical communication with the plurality of fluorescence collection fibers and with the 
plurality of reflectance collection fibers. The detector is in optical communication 
with the imaging spectrograph. 

hi otiier aspects, the plurality of reflectance collection fibers and die 
reflectance illumination fiber may be positioned concentrically about die plurality of 
fluorescence excitation and collection fibers.. . At least one of the pluraUty of 
reflectance coUection fiben may define a collection position at about 180 degrees 
relative to the reflectance illumination fiber. At least one of the plurality of 
reflectance collection fibers may define a collection position at about 90 degrees 
relative to tbc reflectance iUumination fiber.: At least one of the plurality of 
reflectance collection fibers may define a colhiction position at about 45 degrees 
relative to die reflectance illumination fiber. The plurality of collection positions may 
be spaced between about 0 and about 10 millimeters from die reflectance illumination 
fiber. The probe may include between twenty-one and forty-six optical fibers. 
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In another respcci, the invention is a method for combined fluorescence and 
spatially resolved reflectance spectroscopy of a sample. The method includes 
directing radiation to tiie sample witii a fluorescence excitation fiber, collecting 
radiation from the sample with a fluorescence collection fiber, directing the radiation 
from tiie sample to an imaging spcaiogtaph and a detector, illuminating the sample 
with a reflectance illumination fiber, collecting reflected light from the sample with a 
reflectance collection fiber in spaced relation, with tiie reflectance illumination fiber, 
and directing the reflected Ught from die sample to an imaging spectrograph and a 
detector. 

In otiier aspects, die step of collecting reflected light may include collecting 
reflected light from a plurality of collection positions widi a plurality of reflectance 
collection fibers. The step of coUecting reflected light may include coUecting 
reflected light from die sample witii a reflectance coUection fiber defining a coUection 
position at about 180 degrees relative to tiie reflectance illumination fiber. The step of 
coUecting reflected light may include coUecting reflected light from die sample widi a 
reflectance coUection fiber defining a coUection position at about 90 degrees relative 
to tiie reflectance Ulumination fiber. The step of coUecting reflected Ught may include 
collecting reflected light fit)m die sample witii a reflectance collection fiber defining a 
collection position at about 45 degrees relative to the reflectance iUumination fiber. 
The sample may include ovarian, head and neck, or cervical tissue. The metiiod may 
also include analyzing spectral data from the detector to characterize tiie sample. The 
step of analyzing may include pre-processing Uie data and reducing a dimension of die 
data using principal component analysis. The step of analyzing may also include 
selecting one or more diagnostic principal components of tiie data and forming one or 
more algoritiims. The step of analyzing may also include fonning one or more 
composite algoritiims. The step of analyzmg may also include evaluating at least on 
of die algorithms using a cross-vaUdation technique. 

In another respect, die invention is a method for combined fluorescence and 
spatiaUy resolved reflectance specttoscopy of a sample. The metiiod includes 
directing radiation to die sample witii a fluorescence excitation fiber, collecting 
radiation from die sample widi a fluorescence collection fiber, directing die radiation 
from die sample to an imaging spectrograph and a detector, illuminating die sample 
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witli a reflectance Ulumination fiber, coDccting reflected light at a plurality of 
collection positions from the sample with a plurality of reflectance collection fibers 
airangcd in spaced relation, directing die reflected light from the sample to an imaging 
spectrograph and a detector to produce spectral data, pre-processing the data, and 
reducing a dimension of the dau using principal icomponent analysis. 

The method may also include selecting one or more diagnostic principal 
components of the data and forming one of more aljgbritimis. The mediod may also 
include forming one or more composite algbriithms. The metiiod may also include 
evaluating at least one of Uie algorithms using a wbss-vaUdation technique. 

In anotiier respect, die invention is a method for analyzing spectroscopy data to 
define an optimized reduced data set. The method includes pre-processing die 
spectroscopy data, reducing a dimension of die spectroscopy data using principal 
component analysis, and selecting one or more diagnostic principal components of the 
spectroscopy data. 

In other aspects, the spectroscopy dau may include combined fluorescence and 
spatiaUy resolved reflectance spectroscopy data. Jhe step of pre-processing may 
include normalization of die spectroscopy data, The step of pre-processing may 
include mean scaling die spectroscopy data. The . step of pre-processing may include 
calculating one or more derivatives on Uie spectroscopy data. The method may also 
include eliminating redundant data from die spectroscopy data. The metiiod may also 
include forming one or more algoritiims and evaluating at least one of the algoridims 
using a cross validation technique. The metiiod. may also include forming one or more 
composite algorithms. 

AppUcations for the metiiods and apparatus described herein are vast and 
include, but are not limited to, analysis and detection of disease including cancers and 
pre-cancers (such as cervical, bead and neck, colon, lung, esophageal, ovarian) and 
adierosclerosis. AppUcations also include industry, including, but not limited to. tiie 
semiconductor industry. 
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BRIEF DESCRIPTIQN OF THE DRAWINGS 

The foUowing drawings form pan of the present specificaaon and are included 
to further demonstrate certain aspects of the present invention. The invenuon may be 
better understood by reference to one or more of these drawings in combinauon with 
the detailed description of specific embodiments presented herein. 

FIG. 1 Block diagram of a Fast EEM system according to one embodiment of 
the present disclosure. 

FIGS. 2A and 2B Probe output at 332 nm according to one embodiment of 
the present disclosure. 

FIG. 3 Inside of a light source according to one embodiment of the present 
disclosure. 

nG. 4 Outside connectors of the light Source according to one embodiment 
of the present disclosure. 

FIGs. SA and SB Comparison between the monochromator and the spectral 
lamp output 

FIGs. 6A and 6B A probe according to the present disclosure showing 
fluorescence excitation fibers, fluorescence collection fibers, a quartz rod, a 
reflectance excitation fiber, and reflectance collection fibers. 

FIG. 7 Probe according to the present disclosure showing fluorescence fibers, 
a quartz rod, reflectance fibers, illumination fibers, a protection shield, and a quartz 
shield. 

FIGS. 8A and 8C Tip of a probe according to the present disclosure showing 
illumination of i) reflectance ii) fluorescence and iii) illumination fibers. 

FIGS. 9A and 9B Monochromator and spectrograph connector with 
fluorescence and reflectance coUection fiben according to one embodiment of the 
present disclosure. 

FIG. 10 Probe including fiber connectors according to one embodiment of the 
present disclosure. Shown are visual iUumination fiber 113. reflectance excitation 
fiber 1 15, fluorescence excitation fiber 1 17, and reflectance coUection position 1 19. 

FIG. 11 Correction faaors for the specttvgraph. " 
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EIG. 12 Schematic of Binning techniques: On chip binning (left). On chip 
and software binning (right). 

FIG. 13 Main screen of a Fast-EEM user interface according to one 
embodiment of the present disclosure. 

FIG. 14 System block diagram showing a variable excitation light source, a 
fiber optic delivery and collection probe, and a spectral, multichannel analyzer 
according to one embodiment of the present disclosure. 

FIGS. ISA - 15D (left) Schematic diagram of the distal ends of the probe: 
[a] outer shaft, [b] fluorescence excitation and emission fibers, [c] reflectance 
collection and illumination fibers, [d] mixing element, [E] reflectance excitation fiber, 
[1-3] reflectance collection locations, (Right) Schematic diagram of the proximal ends 
of the probe, 

nG. 16A Simulated EEM with peak shifting in [1] excitation wavelength [2] 
and emission wavelength. ; . ■ 

FIGS. 16B-1 to 16B-6 Simulated EEM with peak shifting in [1] excitation 
wavelength [2] and emission wavelength. Calfculated Xav and may for the simulated 
EEM. X.V is sensitive to changes in the excitation position of the peak and mav is 
sensitive to the emission position. 

FIG. 17A EEM of Rhodamine standard solution. 

FIG. 17B EEM of an FAD and microspheres-based tissue phantom measiued 
using a FastEEM system. 

FIGS. 18A - 18D (A) Emission spectra at 360 nm excitation of the 
Rhodamine calibration standard measured with the FastEEM system and SPEX 
Huorolog n fluorimeter. (B) Emission spectra at 360 nm excitation of the scattering 
tissue phanthom containing FAD and polystyrene microspheres measured with the 
FastEEM system and SPEX Fluorolog D fluorimeter. (Q Emission spectra at 450 nm 
excitation of the Rhodamine calibration standard measured with the FastEEM system 
and SPEX Fluorolog n fluorimeter. (D) Emission spectra at 450 nm excitation of the 
scattering tissue phanthom containing FAD and polystyrene microspheres nieasured 
with the FastEEM system and SPEX Fluorolog U fluorimeter. 

FIGS. 19A and 19B In-vivo fluorescence measurements with the FastEEM 
system: (A) Fluorescence EEM of a normal site of the tongue. (B) Fluorescence EEM 
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of a diseased site of the tongue, containing a moderately differentiated squamous ceil 
carcinoma. 

FIGS. 20A - 20C Fluorescence emission spectra of normal and moderately 
differentiated squamous cell carcinoma of the tongue from Figure 6. The spectra were 
normalized to the peak fluorescence at 350 nm excitation, (a) Fluorescence emission 
spectra at 350 nm excitation, (b) Fluorescence emission spectra at 410 nm excitation, 
(c) Fluorescence emission spectra at 460 nm excitkion. 

FIGS. 21A and 21B Emission and excitation autocorrelation vectors of 
normal and moderately differentiated squamous cell carcinoma of the tongue from 
FIGS. 18. (A) Emission autocorrelation vectors. (B) Excitation autocorrelation vectors. 

FIGS. 22A • 22C. Reflectance measuremetits of normal and moderately 
differentiated squamous cell carcinoma of die tongue at three different separations 
from the source fiber. (A) Position 1. 1.1 mm separation. (B) Position 2. 2.1 mm 
separation. (C) Position 3, 3 mm separation. 

FIGs. 23A - 23C A schematic of tiie portable fluorimeter used to measure 
cervical tissue fluorescence spectra at three excitation wavelengths. 

FIG. 24 A schematic of formal analytical process used to develop the 
screening and diagnostic algoritimis. The text, in the dashed-line boxes represent 
mathematical steps implemented on the specual data and the text in the solid line 
boxes represent outputs after each mathematical step (NS - normal squamous^ NC - 
normal columnar. LG - LG SIL and HG - HG SIL).. 

FIGS. 2SA - 2SC (a) Original and corresponding (b) normalized and (c) 
normalized, mean-scaled spectra at 337 nm excitation from a typical patient. 

FIGS. 26A - 26C (a) Original and corresponding (b) normalized and (c) 
normalized, mean-scaled spectra at 380 nm excitation from the same patient 

FIGS. 27A 27C (a) Original and corresponding (b) normalized and (c) 
normalized, mean-scaled spectra at 460 nm excitation from tiie same patient. 

FIG. 28 A plot of Uie posterior probability of belonging to the SIL category of 
all SILs and normal squamous epitiielia from tiie calibration set Evaluation of tiie 
misclassified SILs indicates tiiat one samples with CIN III, two witii GIN H, two witii 
GIN I and two witij HPV are incorrectiy classified. * 
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FIG. 29 A plot of the posterior probability of belonging to the SIL category of 
all SILs and normal columnar cpithelia from the calibration data set. Evaluation of the 
misdassified SELs indicates that three samples with CIN n. three with CIN I and one 
with HPV are incorrectly classified. 

FIG. 30 A plot of the posterior probability of belonging to the HG SIL 
category of all SILs from the calibration set Evaluation of the misclassified HG SILs 
indicates that three samples with CIN m and three with CIN arc incorrecUy classified 
as LG SILs: five samples with CIN I and two with HPV are misclassified as HG SIL. 

FIGS. 31A - 31C Component loadings (CL) of diagnostic principal 
components of constituent algorithm (1), obtained from nonnalized spectra at (a) 337 
(b) 380 and (c) 460 nm excitation, respectively. 

FIGS. 32A - 32C Component loadings (CL) of diagnostic principal 
components of constiment algorithm (2), obtained from nornialized, mean-scaled 
spectra at (a) 337 (b) 380 and (c) 460 nm excitation, respectively. 

FIGS. 33A - 33C Component loadings (CL) of diagnostic principal 
components of constiment algorithm (3), obtained from nonnalized spectra at (a) 337 
(b) 380 and (c) 460 nm excitation, respectively. 

FIGS. 34A • 34D Plots of Frequency of occurrence vs. emission wavelength 
in top 25 perfomiing combinations of three wavelengths: (a) ESL=65%, (b) 
ESU75%. (c) ESU:85%, and (d) ESL=95% 

FIG. 35 Fluorescence emission spectra normalized by the peak intensity of 
the concatenated vector for all 62 sites at 350. 380 and 400 nm excitation. Red lines 
mdicate histologically cancerous, green lines indicate histologicaUy dysplastic, and 
blue lines indicate visually and/or histologically normal sites. 

FIG. 36 Plot of the only eigenvector of diagnostic importance at ESL = 65% 
for wavelengtii combination (350 380 400) (lower line at vector index=200) and the 
corresponding component loading (upper line at vector jndex=200). 

FIG. 37 Plot of emission vector for a wavelengUi combination of tbnc 
excitation wavelengths (350, 380. 400 nm) normalized by the peak intensity of each 
emission spectra. 

nCS. 38A • 38C Reflectance spectra (A), first (B) and second derivation (C) 
for position one. 
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FIGS. 39A - 39C Reflectance spectra (lop), fiist (middle) and second 
derivation (bottom) for position two. 

nCS. 40A - 40C Reflectance spectra (top), first (middle) and second 
derivation (bottom) for position three. 

nCS. 41A - 41C Average reflectance spectra (top), first (middle) and second 
derivation (bottom) for position one. Error bars show standard deviation. 

FIGS. 42A - 42C Average reflectance spectra (top), first (middle) and second 
derivation (bottom) for position two. Error bars show standard deviation. 

nCS. 43A - 43C Average reflectance spectra (top), first (middle) and second 
derivation (bottom) for position three. Error bars show standard deviaUon. 

nCS. 44A - 44C p values comparing the mean intensity, mean first and 
second derivatives of normal tissue versus abnormal tissues, at source detector 
separation 1 (top), 2 (middle) and 3 (bottom). 

HGS. 45A - 45C p values comparing the mean intensity, mean first and 
second derivatives of normal tissue versus dysplastic tissues, at source detector 
separation 1 (top), 2 (middle) and 3 (bottom). 

FIG. 46 Scatter plot of the second derivative at 430 nm for position 2 vs. die 
second derivative at 495 nm for position one. The straight line represents an algorithm 
to separate normal findings from dysplasias and cancers, and results in a sensitivity of 
80% and a specificity of 85%. 

FIG. 47 Scatter plot of Uie second derivative at 45.0 nm for position 1 vs. the 
first derivative at 510 nm for position three. The straight line represents an algorithm 
to separate normal findings from dysplasias and cancers, and results in a sensitivity of 
80% and a specificity of 82%. 

FIG. 48 Scatter plot of the second derivati.ve at 410 nm for position 1 vs. die 
first derivative at 510 nm for position tiiree. The straight line represents an algoritiun 
to separate normal findings from dysplasias and cancers, and results in a sensitivity of 
70% and a specificity of 75%. 

DESCRIPTIO N OF ILLUSTRATIVE KMUnnTMiTisrrg 

FIG. 1 shows one embodiment of an ^aranis 10 according to the present 
disclosure. The apparattis is adapted to measure boUi reflectance and fluorescence 
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data, and may be refeired to as a Fast-EEM system (where EEM stands for excitation 
emission matrix) system. Fast EEM system 10; in one embodiment, may include four 
main components, although those having skiU in the an wiU recognize that more or 
fewer components may be utlized: The compbnrats are: (a) an excitation source 20. 
which may include an are lamp 22 and a monochromator 24 for monochromauc and 
broad band excitation, (b) a fiber optic probe 30. which may be configured to deliver 
excitation light to and coUect remitted fluorescence from a sample 60, (c) a detection 
apparanis 40. which may include a filter wheel* an unaging spectrograph 42, and a 
CCD camera 44 and that spectrally resolves a collected signal, and (d) a control unit 
50. which may be a personal computer used to ran Fast EEM system 10 and to acquire 
data. 

Excitation source 20 

The light source 22 for Fast EEM system 10. which may provide both quasi- 
monochromatic excitation for fluorescence/ W broad band iUumination for 
renectance. may be. in one embodiment, a 150 W ozone free Xe arc lamp (Spectral 
Energy Corp., Westwood NJ) with a spherical rear reflector. 

A condenser system including two piano convex quartz lenses may be used to 
couple Ught into a monochromator 24. With the beiiefit of the present disclosure, 
those having skiU in the art will understand diat any optical filter or device suitable for 
creating bandpass filtered light may be used for monochromator 24. In one 
embodiment, monochromator 24 may be a single monochromator. A manual shutter 
(not shown) may be located between condensing optics and monochromator 24 and 
may be closed to prevent fluorescence excitaUon light from reaching sample 60 during 
reflectance measurements. The scanning speed of monochromator 24 may be. in one 
embodiment, about 10 nm/sec. Ught may be coupled from the output slit of 
monochromator 24 into probe 30 via a fiber optic adapter (Spectral Energy. GMA 
257) (not shown) that includes a quartz plano-convex lens and a 5X quartz 
microscope objective. The light passing through the objective may be focused to an 
appropriate shape to fiU one or more fibers of probe 30. In one embodiment, light 
passing through tiie objeaive may be focused onto a vertical Unc onto twenty-five 
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fibers of probe 30. the twenty-five fibers being arranged in two columns and placed at 
the focal plane of the objective (See FIG. 9A). 

A reflectance excitation fiber (See. e.g.. HG. 6) may be coupled to the lamp 
housing of light source 22 via a micropositioner (not shown). Broadband light exiting 
the lamp housing through an exiting hole may be coupled to a reflectarice illumination 
fiber using a quartz plano-convex lens (NA=0.24). A five position illumination filter 
wheel (not shown) placed between the lamp and the lens may include three long pass 
filters with 50% transmission at 295 nm, 515 nm aind 715 nm. respectively. One of 
the filter positions may be blocked and may act as a shutter to prevent white light from 
reaching sample 60 during fluorescence measurements. 

In another embodiment, the light source 22 for Fast EEM system 10. which 
may provide both quasi-monochromatic excitation for fluorescence and broad band 
illumination for reflectance, may be an ozone-free 450 W Xe arc lamp (FH007. 
Instruments S A, Edison, NJ). 

light used for monochromatic fluorescence excitation may be focused with a 
spherical mirror (not shown) onto the input slit: of monochromator 24. In this 
embodiment, monochromator 24 may be a . double inonochromator (DDD 180, 
Instruments SA, Edison, NJ). A spherical rear reflector (not shown) may redirect light 
that is exiting the lamp in the opposite direction into the opposite direction onto the 
spherical mirror. The sUt may be covered with a sapphire window, which may 
prevent hot air from flowing out of the lamp housing into the monochromator 24. A 
double monochromator may be chosen for monochromator 24 because of its higher 
stray light rejection compared to a single monochromator. A double monochromator 
may be configured in additive mode, which means that the dispersions of the two 
holographic gratings ate added. Stray light in such a configuration may be so slight as 
to be negligible. The focal length of each of the two monochromators may be about 
18 cm and the high throughput may be f/3.9. The two holographic gratings may have 
about 1200 grooves/imn and may be blazed at SOOnm. In this embodiment, the 
system's maximal resolution may be about 0.3 nm with an accuracy of about 0.5 nm. 
The scanning speed in this cmboidiment may be about 150nm/s, and the usable 
wavelength range may be from about 300 to about 1000 nm. Wavelength scanning 
may be achieved with a direct digital stepper-motor with a worm drive mechanism 
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(not 5hown). Three computer-controlled slits (entrance, middle, and exit) may be 
opened between 0 and 7 mm in steps of 12.5 jun. In one embodiment, a slit-width of 
about 2 mm may be chosen for both the entrance and the exit slits. The middle slit 
twice may be opened as wide as the entrance and the exit slit to achieve an optimal 
performance. These settings guaranteed a spectral resolution of about 6 nm FWHM. 

FIG. 2 shows a spectrum taken at 332 nm by coupling light through probe 30 
through a fiber optic ad^ter into a scanning spectrofluorimeter (SPEX, Ruoiolog n, 
Edison, NJ). An emission scan from 300 nm to 600 nm was performed to collect the 
relative intensity of the probe output 

In one embodiment, the coupling of light into a fluorescence excitation bundle 
(See. e.g.. FIG. 6 and HG. 7) was done using a fiber-optic interface kit (220F, 
Instruments SA, Edison. NJ). Two plano-convex lenses (different focal lengths) may 
be matched to different NAs of the exit slit and of a fiber bundle of probe 30 to 
minimize coupling losses. A computer-controlled :shutter (LS6, Vincent Associates. 
Rochester, NY) may be mounted in front of the probe connector to block fluorescence 
excitation light during reflectance measurements, ■-. .. 

Light source 22 may be customized to prpyide white light output White light 
may be needed (a) for reflectance measurements,, (b) for visual observation of a 
meastirement site by a physician, and (c) to monitor the lamp output. 

FIG, 3 shows a top view drawing of the inside of the lamp housing according 
to one embodiment Light bulb 25 and ray traces (dashed lines) for the 
monochromator light are shown. In one embodiment the optimal solution to provide 
white light output to the outside of the housing involved the use a bundle of quartz 
fibers. One biconvex lens, mounted in a custom-made rack inside the lamp housing, 
coupled light into a bundle of three 600 ^m and one 50 jim high-tempcraoirc quartz 
fibers (Thermocoat Hberguide Industries, Stirling, NJ). The light rays are indicated 
by the dotted line in FIG. 3. These fibers transported white light to four connectors on 
the outside of the housing (See FIG. 4). The first connector CI may provide 
excitation light used for reflectance measurements. The five-position illumination 
filter wheel described previously may be placed between two biconvex quanz lenses 
(focal length = 20 mm). The second connector C2 may be equipped with one quartz 
lens (focal length = 20 mm) that focuses light onto the illumination fiber bundle. A 
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second shutter (LS6. Vincent Associates. Rochester, NY) may be placed between the 
connector and the lens, which may be closed during data acquisition and may 
otherwise be held open to deliver light to the iUumination fibers of probe 30. The 
third 600 nm fiber output C3 may be used for other purposes, or not at all. The 50 ^m 
fiber output C4 may couple Ught into a fiber diat is duectly connected to imaging 
spectTOgrafih 42 to record the lamp spectrum for every measurement. In one 
embodiment, however, this option is not used. 

FIG. 5 illustrates the power output of two monochromatic illummation 
systems (one using a 150 W ozone fiee Xe arc lamp and the other using an ozonc-&ee 
450 W Xe arc lamp). The output was measured through probe 30 using a calibrated 
power meter (818-UV. Newport. Irvine, CA) and represents the flux (W) that is 
provided to sample 60, which may be a tissue sample. Above about 400 nm, an 
improvement in power of a factor of four is noticeable. Note that the lamp performed 
poorly below 400 nm. The light output at about 3a0 nm is only about 20% of the peak 
performance at 460 nm. The low UV output niaiy be due to. the fact that lamp is an 
ozone-free model. The Ught bulb is made out of UV blocking glass since Ozone is 
mainly produced in the surrounding air within this; spectral region. In order to have a 
useful S/N ratio prolonged exposure times in the>pcctral region below 400 nm may 
become a necessity. 

Probe 30 

The combined spatial reflectance and fluorescence probe 30 of the present 
disclosure may be built to meet the following criteria. First, the tissue volume probed 
by the reflectance and fluorescence measurements may overlap. Second, because the 
collected fluorescence intensity may be typically three orders of magnitude lower than 
the reflectance intensity, a detector with a high dynamic range may be required. 
Weakening the reflectance excitation light by using a smaller excitation fiber or using 
a number of fluorescence excitation fibers may, however, alleviate this problem. 
Third, the total diameter of the probe may be sniaU enough so that it is possible to 
cover an area of only one tissue type; for example, dysplastic lesions around a wmor 
are likely to be only a few millimeters wide. Fmally. a probe 30 small in diameter 
may give the opportunity to use it for minimal invasive surgeries through uocars. 
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According to one embodiment, probe 30 may fit into a trocar. In one embodiment, it 
is designed to fit into a trocar (Refiex SIR, 5 mm. Richard-Allan Inc.) that is 
commonly used in the Gynecology Department at The Univenity of Texas M. D. 
Anderson Cancer Center. Houston, TX, (UT MDACC). 

One embodiment of a combined reflectance and fluorescence probe 30 
includes a total of 21 quartz fibers (200 pm core diameter, NA=:0.22). With the 
benefit of 4e present disclosure, however, those of skill in die art wiU recognize that 
more or fewer fibers may be used. Additonally, although the present disclosure refers 
to embodiments of a probe including 'Tibers", it will be understood that any channel 
suitable for transmission of light may be substituted therewith. In one embodiment, a 
ring of twelve fluorescence collection fibers 70 surround a circle of seven 
fluorescence excitation fibers 72. In one embodiment (not shown), at least one 
fluorescence fiber may be an integral fluorescence excitation and coUection fiber. At 
the distal end of fluorescence excitation and collection, fibers may be a quartz rod 
(about 1.5 mm diameter, about 7 mm thick) 74 located to ensure an overlap at the 
sample surface between fluorescence excitation and coUection fibers. One reflectance 
excitation fiber 76 and one reflectance coUection fiber 78 (both about 90 jmi core 
diameter) may be placed outside of die quartz rod and flush to the sample, which may 
be tissue, on opposite sides. The reflectance fibers may be about 1.7 mm apart from 
each other, and light may be scattered Uirough. the. same tissue volume that is 
examined for fluorescence. 

in one embodiment, a probe 30 may have a total lengtii of about 28 cm to 
about 35 cm, which allows die probe to pass a trocar shaft. Widi the benefit of tbt 
present disclosure, however, those having skiU in. the art wiU recognize diat die probe 
30. and otiier components described herein, may be made of different size (and 
materials) according to need or desire. 

Turning to FIG. 7 and HG. 8. it may be seen diat the diagnostic portion of 
probe 30 may include forty-six optical fibers (about 200 ^m, NA=0.22) in two 
concentric sections. WiUi Uie benefit of die present disclosure, however, diose of skiU 
in die art wUl recognize tiiat more or fewer fibers may be used. The center bundle 80 
(Sec FIG. 7) aiay contain twenty-five fluorescence excit^on fibers and twelve 
fluorescence coUection fibers. At die distal end of die probe 30. diese fibers may be 
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airanged randomly io central bundle 80 and may be placed in mechanical contact with 
a shon piece (about 1^ cm long) of thick quaitz fiber 82. Ught sent through this rod 
may be distributed over an examined area. The rod's length may be determined by the 
radius of the rod and the NA of the fibers and may be calculated by taking twice the 
radius and dividing it by the fiber NA. 

Nine fibers for illumination and collection of diffuse reflectance may be 
arranged in a ring around the fluorescence fibers (See element 84, HG. 7). Three 
coUection fibers 86 may be located at about 180°, two fibere 88 and 90 may be located 
at about 90', and two fibers 92 and 98 may be located at about 45° from the 
iUufflination fiber 94. A single coUection fiber 96 may be placed direcUy beside die 
reflectance excitation fiber in to measure single backscattered light. Fibers 92 and 98 
may have a distance to the excitation source of about 1.4 mm, fibers 88 and 90 of 
about 2.4 mm, and fibers 86 of about 3.3 mm. The distal ends of die reflectance 
fibers may be flush with the tip of die central , fiberand placed in contact with the 
sample surface. 

For measurements tiiat take longer than , about 30 s, an optical feedback 
mechanism for die probe operator may need to be provided to avoid a displacement of 
die insttument Therefore, a third ring of seven fibers 100, witfi an offset of about 2 
cm (for a 28 cm probe) and about 5 cm (for a 35.cm probe) from die tip may be added 
for illumination purposes. Probe 30 may have a screw-on protection shield 102 at the 
tip of die probe. Specularly reflected light between a quartz shield 104 and die probe 
30, however, may lead to an uncorrectable biasing of die probe performance, and 
Uierefore protection shield 102 may optionally not be used. A 30-minute soaking of 
probe 30 in a disinfecting solution like Cidex™ (Johnson and Johnson Inc.) allows die 
probe to be used in die sterile environment of an operating room. 

The arrangement of fibers at die monochromator 24 and die spectrograph 42 
connectors, according to one embodiment, are shown in HG. 9 The fluorescence 
excitation fibers 108 may be arranged in two rows for optimally fdling by a 
rectangular output beam of die monochromator 24. The fibers on die spectrograph 42 
end may be lined up in a single row, as shown. Fibers 1 10 arc fluorescence coUection 
fibers, and fibers 1 12 (represented by darkened circles) are die reflectance coUection 
fibers. Because saturation in one fiber location may bloom to adjacent pixels on die 
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deieciofi additional spacing, realized by unconnected fibers (illusirated by un- 
darkened circles), reduced this problem. In this embodiment, the spectrograph 
connector contains fiber 114 that may be connected directly to a white light output of 
light source 22, which may be a Xe lamp, to monitor the spectral output of the light 
source over time. 

FIG. 10 illustrates an entire probe 30. according to one embodiment, including 
connectors and connecting fibers. Note that reflectance collection fiber 94 (See FIG. 
8), die position right next to the excitation fiber, inay be intemipted by disconnecting 
SMA connector #2. This feature was created in this embodiment in case the direcUy 
backscattered light signal was too strong and needed attenuation. 

Spectrograph 42 and filter wheel 

Imaging spectrograph 42, in one embodiment, may be a commercial imaging 
spectrograph (Chromex 250 IS, Albuquerque. MM). A grating of about 100 
groove<5/mm, blazed at about 450 nm may be used. With Uie benefit of the present 
disclosure, however, tiiose of skill in the art will understand tiiat any optical filter or 
device suitable for analyzing spectral content of light from one or muUiple sources 
simultaneously may be used for imaging spectrograph 42. 

Light collected by fluorescence and reflectance fibers and die excitation light 
guided direcdy from the source may be coupled tiirough an 8-position, computer 
controlled coUection filter wheel (Optomechanics Research^ Inc., Vail, AZ). into 
imaging spectrograph 42. The filter wheel blocks die fluorescence excitation light 
from entering die spectrograph 42. The spectrograph may contain a holographic 
grating blazed at about 380 nm widi about 100 grooves/mm. The fibers may be 
projected onto an entrance slit (about 250 jim) to yield a spectral resolution of about 
7 nm. 

The non-uniform spectral response of the system may be corrected as shown in 
HG. 11. These correction factors may be determined from measurements of 
calibration sources; in the visible, a N.LS.T traceable tungsten ribbon filament lamp, 
and in die UV, a deuterium lamp may be used (550C and 45D, Optionic Laboratories 
Inc.. Orlando, FL). 
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Variations in the intensity of fluorescence excitation light source at different 
exciution wavelengths may be corrected using measurements of the intensity at each 
excitation wavelength at the probe tip using a calibrated photodiode (818-UV, 
Newport). 

CCD Camera 44 

A thenno-electrically cooled CCD camera 44 (Specu^urce HPC- 1 , WesUaJce 
Village, CA) may be operated at about -30" C and may be located at the back focal 
plane of the imaging spectrograph 42. Chip dimensions may be about 13.8 x 9.2 mm 
with 1536 x 1024 pixels (Kodak KAF-1600 grade 2), to yield a nominal spectral range 
of about 410 nm for a single grating position. Each fiber may take up about 40 pixels. 
The dark current of the CCD chip, in this embodiment, was specified and confirmed 
as 0.25 electrons/pixeysec when operated at -30° C. Quanttun efficiency of die 
lumogen-coated chip may range from a peak of about 40% at about 550 nm to a low 
of about 15% at about 250 nm. 

Binning Pixels 

The HPC-1 CCD camera 44 allows a user to perform on-chip binning of 
pixels. Binning means that neighboring pixels may be added together to represent 
only one data point. This feature is attractive for at least two reasons: (1) it allows a 
reduction in the time required to read data from the chip, and (2) it increases the 
signal-to-noise ratio by reducing tiie effective read out arid shot noise. 

Although a useful feature, excessive binning may diminish tiie resolution of 
the system. FurUiennore, because the full well capacity of tiie pixels and shift register 
is limited, it is possible to exceed Uiis capacity by either grouping too many pixels 
togeUier or by encountering an unexpectedly strong signal (blooming). When 
blooming occurs, charge in excess of die fiill well capacity of a capacitive element 
may spill into adjacent pixels. This can essentially fill the pixels witii charge and 
render diem unavailable for signal detection or perhaps give a false indication of 
signal where none exists. 

In one embodiment, binning was only electronically implemented in tiie spatial 
direction on tiie chip. The 12 fluorescence excitation fibers filled 480 pixels and were 
all binned togeOier. For die reflectance excitation, a combined binning in hardware 
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and- software was used in one embodiment This technique had two advantages: (I) it 
increased the dynamic range compared to a fiiil binning in hardware, and (2) it 
increased the data transfer rates as compared to non-binned data. FIG. 12 shows the 
two different binning techniques. 
5 In one embodiment, the camera 44 and the readout electronics did not operate 

in a reliable manner. Long-term testing showed that counts on every pixel can vary 
from exposure to exposure when the shutter remains closed. A DC offset variation on 
the chip, resulting m an average count of 700/pixel/s to 1500/pixel/s was monitored 
during a 12 hour period. The origin of this behavior was expected to be either a 

10 cooling problem of the CCD camera 42 or an unstable DC offset supplied to the A/D 
convener. In an attempt to cure at least some of the problems, a higher number of 
pixels were digitized that were acnially physically present The count of these fake 
pixels reflected the DC offset of the signal and was found to be independent of the 
detector temperature. Testing showed that the count of these fake pixels varied the 

15 same way as the real pixels did. In this embodiment, monitoring of the background 
was required at every single measurement, since a low fluorescence signal may lie in 
this range. The background could be subtracted from the acquired data. In the 
embodiment, another problem was discoveredi with the readout of the chip. The first 
electronically binned line that was read out was always corrupted and had to be 

20 discharged. This meant that the double amount of pixels were binned into two 
columns from which the first corrupted one was diunped. 

Software and Control 

In one embodiment. National Instruments Labview Version 3.0 (Austin, TX) a 
graphical programming development environment based on the G (Graphic) 

25 programming language may be used to control Fast EEM system 10. The platform for 
the control software may be any suitable control device or computer 50. In one 
embodiment, a laptop 486/75 MHz personal computer with docking station (Austin 
Inc., Austin, TX) was used as computer 50. Communication with the excitation 
monochromator may be provided via an RS-232 control module tiiat is interfaced to 

30 the COM port of the docking station of computer 50. A camera control card may be 
mounted in the docking station. The imaging spectrograph 42 may be operated using 
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a National Instniments GPIB IEEE-488 board that is also located inside the docking 
station of computer SO. 

In another embodiment, a desktop computer was chosen (Optiplex 233GXa, 
Dell Computer Corporation, Round Rock, TX) equipped with a Windows95™ 
operating system as computer 50. All mentioned cards in this embodiment may be 
connected to the ISA-bus of computer 50. A double monochromator 24 and 
spectrogr^h 42 controls may be connected by a GPIB IEEE-488 interface (AT- 
GPffi/TNT , National Instruments. Austin, TX): The two shutters and the filter wheel 
may be controlled with a digital I/O card (PC-PIO.24, National Instruments, Austin. 
TX). The CCD camera 44 may have its own ISA-bus interface card. The readout rate 
of die chip in this embodiment was greater than about 65.000 pixels/s. This gave a 
readout time of about 24 s for the whole chip if no binning was used, hi this 
embodiment, no on board RAM was available to buffer acquired data. 

hi one embodiment. Ubview V.5.0 (National . Instruments, Austin, TX) was 
chosen as the software to control the entire Fast EEM system 10. In this embodiment, 
the goal of software development was to create an easy to use interface that made the 
system controllable by an operator with basic computer knowledge after only a few 
days of training. . . 

Such software may be designed using a small number of basic sub-Vi's (Vi: 
virtual instrument. National Instniments* expression for software units). Operator 
interaction may be minimized to avoid human errors. Automation of file saving and 
auto-naming of saved files may be implemented; to prevent loss of data by mislabeling 
or accidentally overwriting certain files. Such, automation may also speed up the 
interaction time of an operator with die software .between measurements. 

In one embodiment, stored fluorescence data was loaded immediately after 
storage and could be visually inspected in tiie center of the screen. Such a routine may 
be added as a quality-ensuring feature, and it may also help to prevent data loss caused 
by saving errors or misalignment of Uie system if die operator was experienced in 
interpreting the acqtuied data. 
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Software Structure 

FIG. 13 shows a main user interface according lo one embodimait from which 
the Fast. EEM system 10 may be controlled. With the benefit of the presem 
disclosure, those having skill in the art will understand that there are numerous ways 
in which system 10 may be controlled and that the interface shown in FIG. 13 is but 
only one of those ways. Other user interfaces may be implemented as is known in the 
art In FIG. 13, the center displays show four spectra of the last fluorescence 
measurement (top graph) and the acquired reflectance data (bottom). The excitation 
wavelengths of the displayed spectra may be changed online. Around this screen, 
different buttons may be present, which allow access to the certain main feamres. 

h the configuration component of the software interface illustrated in FIG. 13, 
aU the configuraUons were accessible and controllable. In the 'Saving parameter" sub 
program, a patient numbw and the directory path may be defined. The integration 
time for the individual exposures and the settings of the CCD camera 44 may be 
stored in the corresponding subroutine. The Spectrograph settings may be changed in 
the *Chromex*-Vi. The buttons for the mercury calibration, the lamp monitoring, and 
the power output of the probe may also be associated with the configuration settings 
of the software. 

In regard to acquiring date, individual switches for starting the background and 
the standards measurements may be placed on the left side of the spectra display. The 
fluorescence, reflectance and combined reflectaice and fluorescence measurements 
may be initiated in the 'Main Measurements' box.. Naming of files with the acquired 
data may be dependent on which kind of measurenient is chosen. In one embodiment, 
no manual naming of files by the operator was necessary. 

Many additional feamres may be added to the software and user interface. For 
example, an image of the whole CCD chip with all possible settings and binnings may 
be achieved. The monochromator 24 may be moved to any desired wavelength. The 
center wavelength of the spectrograph 42 may be set manually, too. The camera's 44 
exposure time may be adjusted, and it may possible to choose if the shutter of the 
spectrogr^h 42 should open or if it should remain closed to image the dark current. 
Anodier sub Vi may be designed to change all the settings of the monochiomator 24, 
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such.as wavelength, and slit width. Emission and reflectance spectra may be loaded 
and visuaUy compared on the screen. It may be possible to turn on and off the probe's 
iUumination light from the main screen. It shall be understood that none of these extra 
feanires need influence the settings for the main measurements. Default values may 
always be restored when measurements are started. When exiting the software, a 
protocol file may be created that contains all the hnportant settings, the date, file 
names and the name of the operator. In one embodmient, about 1 12 individual Vi's 
were created to design a reliable, easy-to-use and fault-pioof system, although it wUl 
be understood that more or fewer routines may be implemented according to the needs 
or desires of the user. In other embodiments, for instance, a simpler or more 
complicated user interface may be easily implemented as is known in the art. 
Temporal Performance 

Table 2.1 compares the temporal performance of the two embodiments of Fast 
EEM systems described above - one utilizing a 150 W ozone free Xe are lamp, single 
monochromator. and twenty-one fiber probe (Embodiment A); and the other system 
using a 450 W ozone free Xe arc lamp, a double monochromator. and a forty-sU fiber 
probe (Embodiment B). Overall, the time to obtain a complete EEM in Embodiment 
B between 330 nm and 500 nm excitation in steps of 10 nm was cut down to less than 
45 s, a temporal improvement of 105 seconds over Embodiment A. To obtain the 
same amount of counts on the CCD chip, the exposure times may be cut down fiom 
1500 ms to 200 ms, depending on the excitation wavelength. An exposure time of 
375 ms may be expected since the amount of light delivered to the tissue may increase 
by a factor of 4. The alignment on the emission side was improved in Embodiment B, 
so that the tiiroughput was ahnost twice as much as before. The monochromator's 
scanning speed may be decreased from 34 s for an entire scan and resetting to die 
starting wavelengUi to less than 3 s. A faster computer and the use of a 32-bit 
operating system in Embodiment B cut down the computation time by almost 50%. 
However, u still required about 2 s per exposure to transfer the data from die camera 
to die computer. This value adds up to 42 s, 75% of the whole data acquisition time. 
This handicap may be ftiither improved by replacing die readout electronics of die 
CCD chip. The control of die illumination shutter, a new feature of die system, did 
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not add any extra time to the measurements. The shutter opened and closed in less 
thanSms. 

In embodiment B, reflectance measurements may be sped up by using a 
200 Mm fiber for the excitation Ught instead of a SOjun fiber, since more light is 
5 provided to the sample 60, which may be a tissue. A niore intense white Ught output 
of the system may serve the same purpose. By using a different imaging spectrograph 
42 with a grating with lower spectral dispersion, a wider spectmm may be covered on 
the CCD chip. To cov« the desired spectral range for reflectance measurements, only 
two (instead of three) sub-range exposures may be necessary. Overall data acquisition 
10 time over 2 wavelength ranges and four positions may be achieved in 31 s in 
Embodiment B. which is about three times faster than that in Embodiment A, in 
which only 3 spatial positions had to be exposed 
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Table 2.1 Comparison of Temporal Performance: 



- 


Embodiment A 


Embodiment B 


Fluorescence 






Scanning time: 

2 x 500 nin-330nm 


2x 170nm/I0nm/s=34s 


2 X 70 nm/ ISO nm/s =2.7 s 


Exposure time 


18xl.5s = 27s 


20 exposures: 

E = 6.0 s (see 3.1.2) 


Moving filter wlieel 


8x 1 s = 8s 


8xls=8s 


Camera shutter, data 

transpon 


18x4.5s = 81s 


21x2s = 42s 


illumination shutter 




<ls 




Z=l50s 


Z = 53.7 s 


Reflectance 






Exposure time 


9 exposures: 27 s 


8 exposures: 6 s 


Camera shutter, data 
transport 


63 s 


25s 




Z=90s 


Z=31s 



In summary, a combined reflectance and fluorescence measurement with the 
Embodiment B may be obtained in 85 s, about three times faster than with the 
Embodiment A. This temporal improvement may benefit the patient and may also 
minimize the chance that the physician moves the probe during measurements. 

The following examples are included to demonstrate preferred embodiments 
of the invention. It should be appreciated by those of skill in the art that the 
techniques disclosed in the examples which follow represent techniques discovered by 
the inventor to function well in the practice of the invention, and thus can be 
considered to consutute preferred modes for its practice. However, those of skill in 
the art should, in light of the present disclosure, appreciate that many changes can be 
made in the specific embodiments which arc disclosed and still obtain a like or similar 
result without departing from the spirit and scope of the invention. 
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EXAMPLE 1 

Fluorescence Excitation Emission Matrices of Human Tissue: A System for In vivo 
Measurement and Method of Data Analysis 

This example describes a Fast EEM system capable of measuring spatially 
resolved reflectance spectra from 380-950 mn and fluorescence excitation emission 
matrices from 330-500 nm excitation and 380-700 nm emission in vivo. System 
performance was compared to a standard scanning.spectrofluoriineter. This FastEEM 
system was used to interrogate human normal and neoplastic oral cavity mucosa in 
vivo. Measurements were made through a fiber optic probe and required about 4 
minutes total measurement time. This example also presents a method based on 
autocorrelation vectors to identify excitation and emission wavelengths where the 
spectra of normal and padjologic tissues differ most The FastEEM system provides a 
tool with which to study the relative diagnostic abUity of changes in absorption, 
scattering and fluorescence properties of samples, including tissue samples. 
Materials and Methods: 

FIG. 14 iUustrates a block diagram of a Fast EEM system 10 in accordance 
with tiie present disclosure. This system includes at least three main components: (1) 
an arc lamp 22, stepper motor driven monochromator 24 and filter wheel, which 
provides monochromatic and broad band excitation. (2) a fiber optic probe 30 which 
directs excitation light to die sample 60. which rftay be a tissue sample, and collects 
remitted fluorescence from, in tiiis embodiment, one location and diffusely reflected 
light from, in diis embodiment, three locations, and (5) a filter wheel, imaging 
spectrograph 42 and CCD camera 44 which detects the spectrally resolved reflectance 
and fluorescence signals. Excitation monochromator position, filter wheel position, 
spectrograph grating position, CCD operation and data acquisition are controlled 
using a laptop personal computer 50 mated to a docking station. The specifications of 
each sub-system are described below. 

The probe 30. illustrated in FIG. 15, included a total of forty-six optical fibers 
(200 ^m diameter, NA=0.2) arranged in two concentric bundles. The center bundle 
contained twenty-five fluorescence excitation fibers and twelve fluorescence 
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collection fibers. The proximal ends of the fiuorescence excitation fibers were 
ananged in two vertical lines at the exit sUt of the excitaUon monochromator 24 to 
maximize the coupling of the Ught into the sample. The proximal ends of the 
fluorescence collecUon fibers were arranged in a single vertical Une at the entrance slit 
of the imaging spectrograph 42. At the distal end of the probe 30, the fibers that 
excite and collect fluorescence were arranged randomly in a central bundle and placed 
in contact with a short piece of a thick quartz fiber (2 mm diameter, 15 mm long, 
NA=0.2). The distal tip of this fiber was placed in contact widi the sample surface 60. 
and ensured that the area from which fluoresceiice Was collected was the same as that 
directly iUuminated. 

The nine fibers for illumination and collection of diffuse reflectance were 
arranged in a concentric ring around the thick quartz fluorescence measurement fiber. 
The distal ends of these fibers were flush with the tip of the central fiber and were 
placed in contact with the sample surface 60. White light from a port on the side of 
the lamp housing was coupled to the proximal end of a single illumination fiber (80 
Mm. NA 0.2). Photons that scatter thn)ugh the. tissue and exit Uie surface were 
coUected at four different positions with seven collection fibers; three located 180° 
from the iUumination fiber (3 mm distance), two located 90' from the illumination 
fiber (2.1 mm) and two located 45* from the illumination fiber as shown (1.1 mm) 
(See HG. 15). The proximal ends of tiie reflectance collection fibers were situated at 
the top of tile vertical line of fluorescence collection .fibers, separated by dummy 
fibers as shown in FIG. 15. 

The light source 22 for Uie instrument, which provided botii quasi- 
monochromatic excitation for fluorescence and broad band illumination for 
reflectance, was a 150 W ozone free Xe arc lamp (Spectral Energy Corp., Westwood 
NJ) witii a spherical rear reflector. A condenser^ system consisting of two plano- 
convex quartz lenses was used to couple light into monochromator 24. The primary 
condenser was 1 J inches in diameter witij an aperture ratio of ^1.5. The secondary 
condenser was also 1.5 inches in diameter, but was masked to pn)vide numerical 
aperture matching to tiie monochromator 24. A manual shutter was located between 
die condensing optics and monochromator 24 and was closed to prevent fluorescence 
excitation light from reaching die sample 60 during reflectance measurements. The 
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monochromator 24 had ao aperture ratio of f/3.6 (Spectral Energy, GM 252) and was 
used with an ion-etched holographic grating (ISA, Edison. NJ. 240 nm blaze. 1180 
grooves/mm, dispersion = 3.3 nm/mm). An RS-232 controlled stepper motor drove 
the monochromator 24 widi a maximum stepping rate of about 400 step/sec (about 10 
nm/sec). A bandwidth of 6.6 nm was selected by setting the entrance slit of the 
monochromator to about 2.0 mm. Light was cpupled from the monochromator 24 
into the probe 30 via a fiber optic adapter (Spectral Energy, GMA 257) consisting of a 
quartz plano-convex lens and a 5X quartz microscope objective. The Ught passing 
through the objective was focused onto a vertical line of 25 fibers in two columns, 
placed at the focal plane of the objective (See FIG. 15). The reflectance excitation 
fiber was attached to die lamp housing via a microposiUoner. Broadband light exiting 
the lamp housing through an existing hole was coupled to Uie reflectance illumination 
fiber using a quartz plano<onvex lens (NA=0.24). A five position iUumination filter 
wheel placed between Uie lamp and the lens contained Mirce long pass filters witii 50% 
transmission at 295 nm. 515 nm and 715 nm. One of the. filter positions was blocked 
and acted as a shutter to prevent white light from reaching the sample during 
fluorescence measurements. 

Light collected by fluorescence and reflectance fibers was coupled through an 
8 position, computer controlled collection filter wheel, into a Chromex 250 IS 
(Albuquerque. NM) imaging spectrograph 42 containing a holographic grating blazed 
at 380 nm witii 150 grooves/mm and a reciprocal linear dispersion (RID) of 20 
nm/mm. The fibers were projected onto an entrance slit (250 nm) which yielded a 
spectral resolution of about 5 nm. A tiierroo-electrically cooled CCD camera 44 
operated at about -30' C (Spectrasouice HPC-1, WesUak^ Village. CA) was located at 
tiw back focal plane of the imaging spectrograph 42. Chip dimensions were 13.8 x 
92 mm wiUi 1536 x 1024 pixels (Kodak KAF-J600 grade 2). yielding a nommal 
spectral range of about 276 nm for a single grating position. Dark current was 
specified as 0.25 electrons/pixel/sec when operated at -30" C. The quantum efficiency 
of the lumogen coated chip ranged from a peak of 40% at 550 nm to a low of 15% at 
250 nm. 

The detector and imaging spectrograph . were wavelengtii calibrated by 
measuring Uie room light spectra tiiat showed Uuee Mercury, peaks at 404.7, 436 and 
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546 nm. The relatioa between pixels and wavelength was then lineariy fitted through 
these points. 

Fluorescence and reflectance measurements were obtained sequentially. Prior 
to fluorescence measurements, the white light port was closed and pixels iUuminated 
by the fluorescence fibers were selected to be read from the CCD 44. Dark current 
and A/D conversion offset was measured with the same setting as the subsequent 
measurement but with a closed camera shutter. These were subtracted from all 
Huorescence and reflectance measurements. The first excitation wavelength was 
selected by scanning the excitation monochromator, the emission filter wheel was 
rotated to select the appropriate long pass filter and the spectrograph grating was 
adjusted to record signal over the desired emission wavelength range. The 
monochromator 24 and camera shuners were then opened for the desired exposure 
time to record the fluorescence emission spectrum (1.5 seconds). The excitation 
wavelength was then incremented, and the process repeated untU all desired excitation 
wavelengths have been measured. The excitation wavelengths were incremented from 
330 to 500 nm in 10 nm steps. Table 1 contain5: .a list of the exciution wavelengths 
and corresponding long pass fUteis and emission wavelength ranges used in this 
Example. 

FoUowing collection of fluorescence spectra, diffuse reflectance spectfa were 
then measured. For these measurements, the monochromator shutter was closed, the 
emission filter wheel was set to the lowest filter position and the pixels illuminated by 
the corresponding reflectance coUection fibers were selected to be read from the CCD 
44. Dark current and A/D conversion offsets were measured and stored for subtraction 
of the following measurements. The reflectance spectrum was coUected over three 
mumination wavelength ranges. Prior to measurement of each range, the appropriate 
long-pass filter was selected in the illuminaUon filter wheel, and the spectrograph 
grating was adjusted to record signal over the desired wavelength range. The lamp and 
camera shutters were then opened for the desired exposure time to record the 
reflectance spectrum (0.4 - 4.8 seconds). The illumination wavelength range was then 
incremented, and the process repeated until all desired wavelength ranges have been 
measured. Exposure times were determined empirically to achieve a signal to noise 
ratio greater than 20. Table 1 contains a list of the illumination wavelength ranges 
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and corresponding long pass fUiers used for diffuse reflectance measurements. The 
high dynamic range of the reflectance measurements, spanning over three orders of 
magnitude, required that each spatial position be read out individually from the CCD 
44. This prevented saturation and blooming artifacts. 

There are no accepted safety standards for illinnination of mucosal surfaces 
other than skin and cornea. However, the exposure of solar radiation that is 
equivalent to the exposure received when a measurement is made with this system has 
been calculated. The method compares the spectral inadiance [W/cm" nmj of the 
excitation source with solar inadiance data obtained from [NSF Polar Programs UV 
Spectroradiometer Network 1994-1995 Operations Report; NSF UV Radiation 
Monitoring Network 1994 to 1995 Volume 5.0 Data Set. Available at 
WWW.BI0SPHERICAL.COM.]. The comparison includes a point-wise division of 
the inadiance from the FastEEM system to the solar inadiance at the same 
wavelength. This ratio gives a relative solar exposure factor. The solar data is for a 
sunny day in San Diego. California. Irradiation during fluorescence excitation is less 
than 7 times solar exposure at all wavelengths,; Given that fluorescence excitation 
times were 1.5 seconds, this corresponds to exp^^ure to solar radiation for less than 1 1 
seconds in any given wavelength band. During diffuse reflectance measurements, the 
lamp exposure is maximum at 300 nm, where the relative exposure is a factor of 25 
that of the sun. Since the total exposure time for this wavelength band is 14 seconds, 
the exposure corresponds to 350 seconds or less than 6 minutes. All other 
wavelengths have relative exposure factors of 10 or less resulting in a shorter 
equivalent total solar exposure. 

Prior to every patient measurement the . probe output was measured with a 
calibrated power meter (Newport, Irvine, CA, .818-UV) at 400 nm excitaUon 
wavelength. An average output of 86 ^W +/- 12 jiW was achieved at this wavelength 
with a bandwidth of 6.6 nm. Background fluorescence spectra were measured with 
the probe dipped in a non-fluorescent bottle containing distilled water. This 
background HEM was subtraaed from all subsequently acquired EEMs to conect for 
room lights and probe autofluorescence. The non-uniform spectral response of the 
system was corrected using correction factors determined from noeasurements of 
caUbration sources; in the visible a N.LS.T traceable ningsten ribbon fllament lamp 
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and in the UV a deuterium lamp were used (550C and 45D, Optronic Laboratories 
Inc., Oriando, FL). Variations in the intensity of fluorescence excitation liglii source 
at different excitation wavelengths were corrected using measurements of the intensity 
at each excitation wavelength at the probe tip using a caUbrated photodiode (818-UV, 
Newport). Background spectra to correct reflectance measurements for room light 
contributions were measured with all panuneten set as for tissue measurements 
except the white Ught shutter was closed. These measurements were subtracted from 
all subsequent reflectance spectra. 

Fluorescence and reflectance standards were measured before each patient 
measurement The fluorescence intensity was reported relative to the fluorescence 
intensity of a solution of 2 mg/L Rhodaminc 610 (Exciton. Dayton, OH) in ethylene 
glycol at 460 nm excitaUon and 580 nm emission. Reflectance data are reported 
relative a 2.68% by volume solution of 1.072 micron diameter polystyrene 
microspheres (Polyscience Inc., Warrington. PA^jThe microsphere standard was used 
for its weU-characterized optical properties. Th^ total integrated reflectance of this 
standard was measured on a double beam spec^photometer (U-3300 Hitachi, Tolg^o, 
Japan) with an integrating sphere attachment (Labspher? fac. North Sutton, NH). This 
was used to correct the reflectance standard measurements made with the FastEEM 
system. Tissue spectra at each collection fiber position were divided pointwise by the 
corrected standard reflectance spectrum at the corresponding fiber position. 

The EEMs were assembled offline from each series of fluorescence emission 
scans. Data processing and plotting were performed with Matlab. (The Math Works 
Inc., Natick, MA). Reflectance spectra were assembled, from three wavelength areas 
giving a range from 380 to 950 nm. The wavelength range was further reduced (380 - 
800 nm) to comply widi the range of calibration measurements of the reflectance 
standards on the U-3300. Reflectance data were reported between 380 and 595nm. a 
range where the possible influence of room lights in the measurement was mininiized. 
System Validation 

System performance was assessed using two fluorescence standards. The first 
standard was a 2 mg/L Rhodaminc 610 (Exciton Inc., Dayton, OH) ethylene glycol 
solution that is non-scattering, but has peak fluorescence intensity approximately 
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twice ihc average intensity of human cervix. . The second standard mimics the optical 
properties of tissue and consists of 20 jiM Flavin Adenine Dinucleotide (FAD, Kodak, 
Rochester, NY), 0.625 vol% polystyrene micro spheres (Polyscience Inc., diameter = 
1.072 pm). 

Both standards were measured with the FastEEM system 10 and a scanning 
spectronuorimeter (SPEX, Fluorolog II, Edison, NJ). The EEMs measured with the 
SPEX were considered as standards since the performance of the system is well 
documented (dynamic rangeslO*, spectral resolution 5 nm, corrected for non-uniform 
spectral response). The excitation light was incident perpendicular to the sampling 
cuvette and the emitted Ught was collected at approximately a 20 degree angle with 
respect to excitation light A front focus arrangement with a 10 mm cuvette was used 
in the SPEX. 60 minutes were required to coUect a full EEM from each sample with 
the SPEX. 

Clinical Studies * . 

In vivo data were obtained from a group of patients with a known or suspected 
premalignant or malignant lesions of the oral cavity. The studies were reviewed and 
approved by the Internal Review Board of the University of Texas at Austin and the 
Surveillance Conunittee at the UT MD Anderson Cancer Center (Houston). Informed 
consent was obtained from each person in the study. Before using the probe, it was 
disinfeaed with Metricide (Metrex Research Corp.) in accordance with the standard 
clinical protocol. Background fluorescence EEM and reflectance spectra were 
measured by dipping the fiber optic probe in a non-flubrescent bottle filled with 
deionized water. These EEMs and spectra correspond to the system autofluorescence, 
and were subtracted from all subsequently acquired EEMs for that patient Next an 
EEM was measured from a Rhodamine calibration standard and a reflectance 
specmim was measured from a polystyrene solution calibration standard. The probe 
was then guided to the tissue site to be examined and its tip positioned flush with the 
tissue. A fluorescence EEM and reflectance specu-a were obtained from sites within a 
lesion and a clinically normal site. Post-specuoscopy, a 2-4 mm biopsy of the tiissue 
was taken from normal and abnormal sites where the probe measured spectra. These 
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specimens were evaluated by an experienced patbologist. Bonnie Kemp, M.D.. using 
light microscopy and classified using standard diagnostic criteria. 

Data Analysis 

One of the goals of the Fast EEM instrument 10 is to provide information for 
the identification of excitation wavelengths suitable for the differentiation of tissue of 
differing pathological characteristics, as weU as identification of tiie chromophores 
responsible for the differences. While all such infomiation is present in the EEMs 
collected, it can be difficult to extract due to the dimensionality of the data set. A 
method was devised to separately characterize the excitation and emission 
characteristics of the data set 

Given that the EEM has dimensions corresponding to (X,. the foUowing 
autocorrelation vectors are defined: 

m„(xJ = 5:,'!,EEM(x^.X.>EEM^...xJ ;,. 

where x,y(XO is the excitation autoconelation vector and m^Xn) is the emission 
autocorrelation vector. Essentially, the emission autocorrelation vector is the diagonal 
of the product of the EEM witii its transpose, and the excitation autocorrelation vector 
is the diagonal of the product of die ti^spose of die EEM with the EEM. Note that in 
signal processing terms, the autocorrelation vectors. x,v and lUav, are a measure of die 
average signal of die EEM at each excitation or emission wavelengtii, respectively. In 
tills way tiiey provide qualitative information about and EEM. 

An example with simulated data is presented in FIGS. 16A and 16B to 
iUustiate how autocorrelation vectors reflect changes in fluorescence peak positions in 
EEMs. Two kinds of changes are simulated in die modeled data: a shift in Uie 
excitation wavelengtii at which a fluorescence peak appears, and a shift in the 
emission wavelength at which a fluorescence peak appears. The original peak in the 
EEM was modeled as a single gaussian at 380 nm excitation. 550 nm emission widi a 
FWHM of 35 nm in emission and excitation wavelengtiis. The original peak was dien 
shifted by 30 nm in excitation as shown by arrow 1 in FIG. 16A. The shift in 
emission wavelengtii is shown by arrow 2 in HG. 16A, and corresponds to a 30 nm 
shift in die emission peak of die original data. . Three sets of autocorrelation vectors 
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were computed: one for the EEM with the original peak, one for the EEM with the 
excitation wavelength-shifted peak, and one for the EEM with the emission 
wavelength-shifted peak. The autocorrelation vectors are shown in FIG. 16B. 
Comparing the vectors for the original EEM (row 1 in FIG. 16B) with the vectors 
from the EEM with the excitation wavelength-shifted EEM (row 2 in FIG. 16B), it is 
seen that the excitation autocorrelation vector is sensitive to the change in excitation 
wavelength but not in emission wavelength. Similarly, comparing the autocoiiclation 
vectors for the original EEM with the vectors from the EEM with the emission 
wavelength shift in the peak (row 3 in HG. 16B) shows that the emission 
autocorrelation vector is sensitive to the changes in emission wavelength but not 
excitation wavelength. 

It is sometimes desirable to normalize the autocorrelation vectors to facilitate 
comparisons between different sets of measurements. Normalized autocorrelation 
vectors have been calculated by dividing these .vectors by their RMS value, in effect 
forcing the area of the vector to one unit of signal enagy. The normalized emission 
autocorrelation vector is well suited for the identification of differential features in 
EEMs. such as the shifting or broadening of fluotjescence. peaks. 

Results and Discussion: 

FIGS. 17A and 17B show fluorescence EEMs of the non-scattering 
Rhodamine standard and the scattering FAD phantom obtained widi a FastEEM 
system 10. Intensities are reported relative the Rhodamine intensity measured at 460 
nm excitation and 580 nm emission wavelength. FIGS. 18A and 18B show 
fluorescence emission spectra of the Rhodamine standard obtained at 370 and 450 nm 
excitation with the SPEX and the FastEEM system 10 as well as the fluorescence 
background. FIG. 18B and 18D show the same spectra for scattering FAD phantom 
obtained at the same excitation wavelengths, the spectra are normalized at their 
maximum. Note the presence of Rayleigh scatteri'ng peaks from the excitation source 
in the data taken with the SPEX. In general, from non-scattering samples (FIG. ISA, 
18Q the FastEEM system 10 collects less light above 600 nm than the SPEX. This 
may be due to the different collection efficiencies of the FastEEM probe and the front 
face coUecUon geometry of the SPEX. Under scattering conditions and with lower 
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fluorescence signal, the influence of background fluorescence becomes more critical. 
At 370 nm excitation wavelength the FastEEM system 10 measures more fluorescence 
below 500 mn. A comparison with the measured fluorescence background however 
shows that the additional signal has the same shape as the background. It has been 
hypothesized that the background may have been underestimated by measuring it in a 
oon-scanering non-fluorescent media. 

In -vivo fluorescmce EEMs of the oral cavity were measured from 7 1 sites and 
in-vivo reflectance spectta were measured from 49 sites. These were obtained ftom 
patients in two smdies. The first study included patients with abnormal oral lesions 
identified in a previous medical examination (17 patients). The second study, 
conffibuting nine patients, was of normal volunteers. All sites interrogated 
spectroscopically in patients with lesions were biopsied and submitted for 
histopathological analysis. Spectra and biopsies were also obtained from a 
conttalateral site with no lesion in these patients with abnormal lesions. These 
biopsies were also evaluated histopathologically. . No biopsies were taken from the 
normal volunteers. In this Bxamplt, the inventors, show representative EEMs from 
tissue found to be histopathologically nomial. and rnalignant to illustrate spectral 
features detectable with the FastEEM system. : 

Two EEM contour plots fix)m a nonmal and an abnormal area of the tongue are 
presented in FIGS. 19A and 19B. respectively. In the normal sample, fluorescence is 
observed diroughout the whole collection range, with a peak located at 330/380 
(excitation/emission) and a ridge extending from 340/450 to 450/500. Table II lists 
excitation-emission maxima pairs of endogenous tissue chromophores. Comparison 
of the observed peaks with Table n shows these peaks are consistent with the 
emission of strucniral proteins such as collagen and elastin, pyridine nucleotides 
(NADH) and flavoproteins (FAD). The noimal site shows overaU increased 
fluorescence with respect to the abnormal site shown in FIG. 19B. The abnormal site, 
assessed by a pathologist as being moderately differentiated squamous cell carcinoma, 
also shows broad fluorescence throughout Peaks are observed at 330/380. 350/460, 
460/520 and 500/630. A valley is seen at 420 nm excitation between 560 and 580 
emission. This valley is seen to extend along the 420 nm excitation line as well as the 
580 nm emission line. Table m suggests that these features are produced by 



wo 99/57529 . PCT/US99/09768 

36 

hemoglobin reabsoiption. Hemoglobin rcabsorpiion may also in part account for the 
shift in the peaks of the abnormal EEM relative to. the normal EEM. A summary of 
the excitation and emission maxima for the peaks observed in the normal and 
abnormal sites measured is presented in Table IV. 

Fluorescence emission spectra at three selected excitation wavelengths are 
shown in FIG. 20. illustrating changes in relative intensities of fluorescence emission. 
For comparison purposes each set (normal/abnormal) was normalized to the 
maximum at 350 nm excitation. FIG. 20(a) shows the emission spectra at 350 nm 
excitation. Fluorescence from the normal site is seen as z broad peak with a maximum 
at 455 nm. The peak from the abnormal site is . seen to be narrower and red-shifted. 
Examination of this spectrum a 410. 540 and 580 nm suggests that the change in 
lineshape is due to oxygenated hemoglobin. The general line shapes of the 
ftuorescence observed at 410 nm excitation (FIG, 20(b)) are seen to be similar for 
both sites in the 450-575 nm excitation range^.iwih a broad peak at 500 nm. The 
abnormal site shows a significantly lower fluoiescence intensity, as well as an extra, 
narrow fluorescence peak at 640 nm, attributed to porphyrin fluorescence. FIG. 20(c) 
shows the emission spectra at 460 nm excitation. The normal site shows a broad peak 
at 520 nm and clear modulation from hemoglobin reabsojrption at 540 and 580 nm. 
Fluorescence from the abnormal site shows an even more marked hemoglobin 
reabsorption; also the overall fluorescence intensity is reduced. 

FIGS. 21A and 21B show the emission and excitation autocorrelation vectors 
for the same measurements. Note that the plots have a logarithmic y-axis. The 
emission autocorrelation vectors have a large broad peak at 460 nm corresponding to 
the main fluorescence peak observed in the EEMs. The vectors show the effect of 
hemoglobin absorption around 410. 540 and 580 nm in the abnormal site and the 
presence of additional fluorescence in the UV in the normal sample (FIG. 21A). This 
autocorrelation vector also highlights the peak at 610 nm in the abnormal sample. 
The excitation autocorrelation vectors show different line shapes. The curve 
corresponding to the normal site decreases steadily from 330 nm to 500 nm excitation. 
The curve from the abnormal site shows a peak at 350 nm and a minimum at 410 nm. 
The latter illustrates the greater influence of hemoglobin reabsorption in the abnormal 
sample also shown in FIG. 20. 
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The corresponding reflectance data is shown in FIGS. 22A-22C. Position 1 
corresponds to the collection fibers closest to the source fiber and position 3 to those 
furthest from the source fiber as shown in FIG. 15. The difference in position allow 
for spatially resolved reflectance measurements. Differences induced by the 
fluorescence reabsorption of oxygenated hemoglobin in the normal site and abnormal 
site arc shown. The modulation of the spectrum, by the 540 and 580 absorption bands 
is seen to be significantly stronger in the abnormal san^le; this is consistent with the 
increased reabsorption seen in the fluorescence, spectrum of the abnormal sample. 
The reflectance in the blue range (450-500nm) of the abnormal site is consistently 
higher than that of the normal site. Below 450 nm the reflectance seems not to differ 
between the normal and abnormal samples. 

Conclusions 

The total data acquisition time for the data presented here was 2.5 minutes for 
a fluorescence EEM. and 1.5 minutes for the spatially resolved reflectance 
measurements. However, only 29 seconds of this time represented fluorescence 
collection. Actual reflectance collection time was 26 seconds. The most time 
consuming process was changing the excitation wavelength using the stepper motor 
conu-olled excitation specu-ograph and changing the corresponding long-pass filter 
using the remotely controUed filter wheel. Worm drive based monochromaiors are 
available (DDD180. ISA) which require less than 10 seconds to scan our entire 
wavelength range in 10 nm steps, and could substantially reduce the total 
measurement time. Using a higher power lamp may further reduce acquisition time of 
both fluorescence and reflectance. 

This Example has demonstrated the acquisition of EEMs in combination with 
spatially resolved reflectance measurements of tissue phantoms and in the oral cavity 
in vivo with good signal to noise ratio. The system features easy and arbiu-ary 
selection of excitation wavelengths in the UV and visible range. The system is also 
portable, and capable of fiinctioning in a hospital operating room. Probes used in the 
Fast HEM system incorporate channels to measure spatially resolved reflectance and 
fluorescence, and are built small enough (Less !than about 5mm) to be used during 
endoscopic surgical procedures. Autocorrelation vectors Xav and m,y are a suitable 
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method to reduce the data set while preserving information about the wavelength 
bands caxiying information. Based on the representative data shown here, fluorescence 
emission and excitation as well as reflectance data appear promising for the 
identification of tumors of the oral cavity. The Fast EEM system is an ideal tool to 
identify a subset of the most promising optica] features to identify pathological 
findings in large clinical studies. 
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EXAMPLE 2 

Cervical Pre-Cancer Detection Using A Multivariate Statistical Algorithm Based On 
Laser Induced Fluorescence Spectra At Multiple Excitation Wavelengths 

A portable fluorimeter was developed and utilized to acquire fluorescence 
spectra from 381 cervical sites in 95 patients at 337, 380 and 460 nm excitation 
immediately prior to colposcopy. A multivariate statistical algorithm was used to 
extract clinically usefiil information fxom tissue spectra acquired in vivo. Two full- 
parameter algorithms were developed using tissue fluorescence emission spectra at all 
three excitation wavelengths (161 excitation-emission wavelength pairs) for cervical 
pre-cancer (squamous intraepithelial lesion (SE<)) detection: a screening algorithm 
which discriminates between SILs and non SILs with a sensitivity of 82%± 1.4 and 
specificity of 68%±0.0. and a diagnostic algorithm which differentiates high grade 
SILs ftom non high grade Stt-s with a sensitivity and specificity of 79%±2 and 
78%±6, respectively. Multivariate statistical analysis was also employed to reduce the 
number of fluorescence excitation-emission wavelength pairs needed to re-develop 
algorithms that demonstrate a minimum decrease in classification accuracy. Two 
reduced-parameter algorithms which employ fluorescence intensities at only 15 
excitation-emission wavelength pairs were developed: the screening algorithm 
differentiates SILs from non SILs with a sensitivity of 84%±1.5 and specificity of 
65%±2 and the diagnostic algorithm discriminates high. grade SILs from non high 
grade SILs with a sensitivity and specificity of 78%.±0.7 and 74%±2, respectively. 
Both the fiill-parameter and reduced-parameter screening algorithms discriminate 
between SILs and non SILs with a similar specificity (±5%) and a substantially 
improved sensitivity relative to Pap smear screening. A comparison of the fiill- 
parameter and reduced-parameter diagnostic algorithms to colposcopy in expert hands 
indicated that all three have a very similar sensitivity and specificity for differentiating 
high grade SILs from non high grade SILs. 

This paper presents the development and ^plication of a detection technique 
for human cervical pre-cancer based on laser induced fluorescence spectroscopy. A 
portable fluorimeter consisting of two nitrogen pumped-dye lasers, a fiber-optic probe 
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and a polychromaior coupled to an optical multi-channel analyzer was utilized to 
acquire fluorescence spectra from 381 cervical sites in 95 patients ai three excitation 
wavelengths: 337, 380 and 460 nm. A general multivariate statistical algorithm was 
then used to analyze and extract clinically useful information from tissue spectra 
acquired in vivo. First, a screening algorithm was developed to discriminate between 
SILs and non SJLs (normal squamous and columnar epithelia and inflanunation); 
second, a diagnostic algorithm was developed to differentiate HG SILs from non HG 
SILs (LG SILs, normal q)ithelia and inflammation). The retrospective and prospective 
accuracy of both the screening and diagnostic algorithms were compared to the 
accuracy of Pap smear screening and to colposcopy in expert hands. 

The general multivariate statistical algorithm was initially developed and 
tested using cervical tissue spectra acquired at 337 nm excitation from 476 cervical 
sites in 92 patients. This algorithm could be used to differentiate SILs and normal 
squamous tissues with an average sensitivity and specificity of 91%±2 and 78%±3, 
respectively. A limitation however is that spectra of normal columnar tissues and 
inflanunation were indistinguishable from those, of SILs at this single excitation 
wavelength. Furthermore, a multivariate statistical algorithm based solely on spectra 
at 337 nm excitation could not discriminate between . HG SILs and LG SILs 
effectively. 

However, multivariate statistical analysis of cervical tissue fluorescence 
specura acquired in vivo at 380 nm and 460 nm excitation fix>m a subset of the 92 
patients indicated that specu^ at these excitation wavelengths can overcome the 
limitations of specu^ at 337 nm excitation. Spectra at 380 nm excitation from 165 
sites in a first group of 40 patients could be used to differentiate SILs from normal 
columnar epithelia and inflammation with a sensitivity and specificity of 77%± 1 and 
72%±9, respectively; spectra at 460 nm excitation from 149 sites in a second group of 
24 patients could be used to differentiate HG SILs from LG SILs with a sensitivity 
and specificity of 80%±4 and 76%+5, respectively. 

The results from previous clinical studies suggested that an algorithm based on 
normalized, mean-scaled spectra at 337 nm excitation may be used to differentiate 
between SILs and normal squamous tissues, while an algorithm based on similarly 
pre-processed specua at 380 nm excitation may be used to differentiate SILs from 
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normal columnar tissues and samples with inflammation. Fmally, a third algorithm 
based on normalized tissue spectra at 460 nm cxciuiidn may be used to discriminate 
between LG SILs and HG STLs. These results suggest that (1) a composite screening 
algorithm based on a combination of the first two constituent algorithms may be used 
to differentiate between SILs and non SILs (normal cpithelia and inflammation) and 
(2) a composite diagnostic algorithm which combines all three constituent algorithms 
may be used to differentiate HG SILs from non HG SILs (LG SILs. normal tissues and 
mflammation). 

The primary goal of the clinical smdy described in this Example was to 
evaluate the accuracy of constituent and composite algorithms which address certain 
limitations of previous clinical smdies. Fluorescence spectra acquired in vivo at all 
three excitation wavelengtiis from 381 cervical sites in 95 patients were analyzed to 
determine if the accuracy of each of the Uuee constituent algorithms previously 
developed may be improved using tissue spequa:;at a combination of two or three 
excitation wavelengths ratiier tiian at a single excitation wavelengtii. A second goal of 
the analysis was to integrate the three independeritiy developed constituent algorithms 
tiiat discriminate between pairs of tissue types into composite screening and diagnostic 
algorithms that may achieve discrimination between many of Uie clinically relevant 
tissue types. The effective accuracy of a composite, screening algoritiim for Uie 
identification of SBLs and a composite diagnostic algorithm for the identification of 
HG SILs was evaluated. 

The final goal of the analysis was to determine if fluorescence intensities at a 
reduced number of excitation-emission wavelengtii pairs may be used to re-develop 
constituent and composite algoriUims tiiat may achieve classification witii a minimum 
decrease in predictive ability. A significant reduction in tiie number of required 
fluorescence excitation-emission wavelengtii pairs may enable tiie development of a 
cost-effective clinical fluorimeter. The accuracy: of the constituent and composite 
algoritiuns based on the reduced emission variables was compared to Uie accuracy of 
those that utilize entire fluorescence emission spectra. 
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Instrumentation 

A schematic of the portable fluorimctcr which was used lo acquire cervical 
tissue fluorescence spectra at three excitation wavelengths is shown in FIG. 23(a). The 
fiber-optic probe (Valdor Fiber Optics, VSC/FER/4SMA-1/7-BUN) included a central 
fiber surrounded by a circular array of six fibers; all seven fibers having the same 
characteristics (0.22 NA, 200 jrni core diameter, 245 jun diameter with cladding). 
Three fibers along the diameter of the distal end of the probe (FIG. 23(b)) were used 
for excitation light delivery. The purpose of the remaining four fibers was to collect 
the emitted fluorescence from the area directly illuminated by the probe. A quartz 
shield (3 mm in diameter and 2 nrni thick) at the tip of the distal end of the probe that 
is in direct Ussue contact (FIG. 23(c)) provided a fixed distance between the optical 
fibers and the tissue surface so fluorescence intensity can be measured in calibrated 
units. 

An area, 1 mm in diameter was illuminated by each excitation fiber. The 
overlap of the illumination area viewed by the three excitation fibers and the four 
collection fibers was approximately 80% at the outer surface of the quartz shield. Note 
that the central excitation fiber has four adjacent collection fibers whereas the two 
excitation fibers in the periphery of tiie probe have only two adjacent collection fibers 
(FIG. 23(b)). However, due to tiie large overlap of the optical fibers at tiie outer face 
of the quartz shield, tiiis difference in the excitation-emission configuration relates 
only to a small difference in the collection efficiency of tiie fluorescence generated 
due to excitation delivered by the central and peripheral excitation fibers. The 
diflfercnce in coUection efficiency is accounted fw. by normalizing tissue fluorescence 
spectra to the peak fluorescence intensity of a Rhodamine 610 calibration standard 
measured using the same probe configuration. 

Two nitrogen pumped-dye lasers (laser characteristics: 5 ns pulse duration, 30 
Hz repetition rate) (Laser Photonics, LN300C) were used to provide illumination at 
tiiree different excitation wavelengOis: one laser served to deliver excitation light at 
337 nm (fimdamental) and had a dye module which was used to generate light at 380 
nm using tiie fluorescent dye, BBQ (lE-03 M in.7 parts toluene and 3 parts etiianol). 
The dye module of die second laser was used to provide illumination at 460 nm. using 
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the fluoiescent dye. Coumarin 460 (1B02 M in ethanol). Laser illumination at each 
excitation wavelength. 337, 380 and 460 nm was coupled into each of the three 
excitation fibers of the probe. Note that two 10 nm bandpass filters, one centered at 
380 nm and the other centered at 460 nm were placed between the excitation fiber and 
the dye module used to generate illumination at 380 and 460 nm. respectively to 
prevent leakage from the fundamental at 337 nm. In Uiis Example, the average fluence 
per pulse at 337. 380 and 460 nm excitation were 15.2, 11.5 and 18 jJ/mm'. 
respectively. The pulse energy at 337 nm excitation was intentionaUy reduced so that 
tfie measured fluorescence signal did not exceed the dynamic range of the detector. 

The proximal ends of the four coUection fibers were arranged in a circular 
array and imaged at the 500 pm wide entrance slit of a ^3.8 spectrograph equipped 
with a 300 ta/mm grating (JarreU Ash, Monospec 18) coupled to a 1,024 intensified 
diode array coniroUcd by a multi-channel analyzer (Princeton Instruments, OMA). 
The collection optics between the proximal end/of ihe four emission coUection fibers 
and the polychromator included two quartz piano convex lenses. Between Uiese lenses 
was a filter wheel assembly containing long pass filters widi 50% transmission at 360 
(GG360). 400 (GG400) and 475 (GG475) nm which are used to block scattered 
excitation tight at 337, 380 and 460 nm excitation, respectively from the detector. The 
purpose of the filter wheel was to position the appropriate long pass filter in the 
optical path during fiuorescence measurements at each excitation wavelengtij. The 
nitrogen pumped-dye lasers were used to externally trigger a pulser (Princeton 
Instruments, PG200) which served to synchronize tiie 200 ns collection gate of the 
detector to tiie leading edge of tiie laser pulse. The gating of the detector eliminated 
the effects of tiie colposcope's white Ught .illumination during fluorescence 
measurements. Data acquisition was computer controlled. 

CUnical measurements 

A randomly selected group of non-pregnant patients referred to tiie colposcopy 
clinic of die University of Texas MD Anderson Cancer Center on tiie basis of 
abnormal cervical cytology was asked to participate in tiie in vivo fluorescence 
spectroscopy study. Informed consent was obtained from each patient who 
participated and the study was reviewed and approved by the histitutional Review 
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Boards of the University of Texas. Austin and the University of Texas, MD Anderson 
Cancer Center. Each patient underwent a complete history and a physical 
examination including a pelvic exam, a Pap smear and colposcopy of the cervix, 
vagina and vulva. After colposcopic examination of the cervix, but before Ussuc 
biopsy, fluorescence spectra were acquired on average from two colposcopically 
abnormal sites, two colposcopically normal squamous sites and 1 normal coluimiar 
site (if colposcopicaUy visible) from each patient Tissue biopsies were obtained only 
from abnormal sites after they had been identified by colposcopy and then analyzed by 
the probe. Tissue biopsies were not obtained from normal squamous or columnar sites 
analyzed by the probe to comply with routine patient care procedure. All tissue 
biopsies were fixed in fonnalin and submitted for histologic examinaUon. 
Hemotoxylin and eosin stained secUons of each biopsy specimen were evaluated by a 
panel of four board certified pathologists and a consensus diagnosis was established 
using the Bethesda classification system. . This - classification system which has 
previously been used to grade cytologic specimens has now been extended to 
classification of histology samples. Samples weje classified as normal squamous, 
normal columnar, inflammation, LG SIL or HQ:§JL Samples with multiple diagnoses 
were classified into the most severe hlsto-pathologic category. 

Prior to each patient study, the probe was disinfected and a background 
spectrum was acquired at all three excitation wavelengths consecutively with the 
probe dipped in a non-fluorescent botUe containing distilled water. The background 
spectrum indicated no fluorescence due to optical components of the fluorimctcr or 
the disinfectant and was subtracted from all subsequently acquired spectra at 
corresponding excitation wavelengths for that patient. Next, with the probe placed on 
the face of a quartz cuvette containing a solution of Rhodamine 610 dissolved in 
ethylene glycol (2 mg/L), 50 fluorescence spectra were measured at each excitation 
wavelength. After calibration, fluorescence spectra were acquired from the cervix: 10 
spectra for 10 consecutive pulses were acquired.at 337 nm excitation; next. 50 spectra 
for 50 consecutive laser pulses were measured at 380 ran excitation and then at 460 
nm excitation. The data acquisition time was 0.33 s at 337 nm excitation and 1.67 s at 
each 380 and 460 nm excitation per cervical site. The time required to switch between 
tile two nitrogen pumped-dye lasers and tiie three long pass filters was approximately 
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5 s. Hence, the total time required to record fluorescence emission spectra at all three 
excitation wavelengths from one cervical siie was approximately 10 s. Spectra were 
collected in the visible region of the electromagnetic specumn with a resolution of 10 
nm (full width at half maximu m ) and a signal to noise ratio of 100:1 at the 
fluorescence maximimi at each excitation wavelength. 

All spectra were corrected for the non-uniform spectral response of the 
detection system using correction factors obtained by recording the spectrum of an 
N.LS.T traceable calibrated tungsten ribbon filament lamp; Spectra from each cervical 
site at each excitation wavelength were averaged to obtain a single spectram per site. 
The fluorescence spectra obtained at each excitation wavelength from the Rhodamine 
610 calibration standard were also averaged to obtain a single spectrum per excitation 
wavelength. The average tissue spectra were then normalized to the average peak 
fluorescence intensity of the Rhodamine 610 calibration standard at the coirespondmg 
excitation wavelength for that patient; absolute fluorescence intensities are reported in 
these calibrated units. In this clinical study, fluorescence spectra were acquired at all 
three excitation wavelengths from each cervical-site from a total of 381 sites in 95 
patients during colposcopy. 

Development of screening and diagnostic algorithms 

FIG. 24 illustrates a schematic of the formal analytical process used to develop 
screening and diagnostic algorithms for the differential detection of SILs. in vivo. In 
FIG. 24. the text in the dashed-line boxes represdit the mathematical steps 
implemented on the spectral data, and the text in the solid-line boxes represent the 
output after each mathematical process. There are four primary steps involved in the 
multivariate statistical analysis of tissue specu^ data (FIG. 24). The first step is to 
pre-process spectral data to reduce inter-patient and intra-patient variation within a 
tissue type; the pre-processed specu-a are then dimensionally reduced into an 
informative set of principal components that describe most of the variance of the 
original spectral data set using Principal Component Analysis (PGA). Next, the 
principal components that contain diagnostically relevant information are selected 
using an unpaired, one-sided student's t-test, and finally a classification algorithm 
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based on logisUc discriminaiion is developed using these diagnostically relevant 
principal components. 

In summary, three consHtuent algorithms were developed using multivariate 
staiisu^al analysis (Fig. 24): constituent algorithm (1) discriminates between SILs and 
normal squamous tissues, constituent algorithm (2) discriminates between SJLs and 
normal columnar tissues and finally, algorithm (3) differentiates HG SILs from LG 
SLs. The three consHtuent algorithms were then combined to develop two composite 
algorithms (Rg. 24): constituent algorithms (1) and (2) were combined to develop a 
composite screening algorithm which discriminates between SILs and non SJL&. AU 
three constituent algorithms were then combined to develop a composite diagnostic 
algorithm which differentiates HG SILs from non HG SILs. 

Multivariate statistical analysis cf cervical tissue spectra 

As a first step, three methods of pre-processing were applied to the spectral 
data at each excitation wavelength: (1) normalization (2) mean-scaling and (3) a 
combinaUon of normalization and mean-scaling. Similarly pre-processed spectra at 
each excitation wavelength were combined to create speciral inputs at the following 
combinations of excitation wavelengths: (337. 460) nm. (337. 380) nm. (380. 460) am 
and (337, 380. 460) nm. Pre-processing of spectral data resulted in four types of 
spectral inputs (original and three types of pre-processed spectral inputs) at three 
single excitation wavelengths and at four possible combinations of multiple excitation 
wavelengths. Hence, there were a total of 12 sjpectral inputs at single excitation 
wavelengths and 16 spectral inputs at multiple excitation wavelengths which were 
evaluated using the multivariate statistical algorithm. 

Prior to PCA, the input data matrix, D (r'x c) was created so each row of the 
matrix corresponded to Uie pre-processed fluorescence spectrum of a sample and each 
column corresponded to Uie pre-processed fluorescence intensity at each emission 
wavelength. Spectral inputs at multiple excitation wavelengtfis were created by 
arranging spectra at each excitation wavelengtii in series in Uie original spectral data 
matrix. PCA was used to dimensionally reduce tiie pre-processed spectral data matrix 
into a smaller orthogonal set of linear combinations of tiie emission variables that 
account for noost of the variance of the spectral data set. 
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Average values of principal component scores were calculated for each 
principal component of each tissue type. An unpaired, one-sided smdenfs t-test was 
eix^>loyed to determine the diagnostic content of each principal component. The 
hypothesis that the means of the principal component scores of two tissue types are 
different was tested for (1) normal squamous epithelia and SILs, (2) normal columnar 
epithelia and SILs and (3) inflammation and SEjs. The t-test was extended a step 
further to determine if there were any statistically significant differences between the 
means of the principal componoit scores of HG SILs and LG SILs. Principal 
CQii^nents for which the hypothesis stated above was statistically significant (P < 
O.OS) were retained for further analysis. 

Next, a statistical classification algorithm was developed using the 
diaposucally useful principal components to calculate the posterior probabiUty that 
an unknown sample belongs to each tissue type under consideration. The posterior 
probability of an unknown sample belonging to each tissue type was calculated using 
logistic discrimination. The posterior probability is related to the prior and conditional 
joint probabilities and to the costs of misclassification of the tissue types under 
considwation. The prior probability of each tissue type was determined by calculating 
the observed proportion of cases in each group; Ths cost of misclassification of a 
particular tissue type was varied fiom 0 to 1 in 0,1 mcrements. and the optimal cost 
was identified when the total number of misclassified samples based on the 
classification algorithm was a minimum. If there :was more than one cost ai which the 
total number of misclassified samples was a minimum, the cost that maximized 
sensitivity was selected. The conditional joint probabilities were developed by 
modeling the probability distribution of each principal component of each tissue type 
using the normal probabiUty density function, which is characterized by p. (mean) and 
o (standard deviation). The best fit of the normal probability density fimction to the 
probability distribution of each principal component (score) of each tissue type was 
obtained in the least squares sense, using n and <t as free parameters of the fit. The 
normal probability density function was then used to calculate the conditional joint 
probability that an unknown sample, given that it is irom tissue type i, will exhibit a 
set of principal component scores, X. 
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. The multivariate statistical algorithm was developed and optimized using a 
calibration set and then tested on a prediction set of approximately equal prior 
probability (Table I). The purpose of testing the algorithm on the prediction set was to 
determine (1) an unbiased estimate of Uie algorithm's classification accuracy and (2) if 
die number of sample spectra wiUiin each category in the calibration set is sufficient 
to describe the spectral data in the prediction set. The caUbriation and prediction sets 
were developed by randomly assigning the spectral data into the two sets witii the 
condition that both contain roughly equal numbbr of samples from each histo- 
paUiologic category. The random assignment ensured tiiat not all spectra from a single 
patient were contained in the same data set. 

Development of constituent algorithms 

The multivariate statistical algoritiun was. developed and optimized using all 
28 types of pre-processed spectral inputs from die calibration set. The algorithm was 
used to identify spectral inputs which provide the. greatest discrimination between the 
foUowing pairs of tissue types: (1) SILs and normal squamous epiUieUa, (2) SILs and 
normal columnar epitiieUa. (3) SILs and inflainmation. and (4) HG SILs and LG SILs. 
The optimal specu-al input for differentiating between two particular tissue types was 
identified when tbt total number of samples misclassified from die calibration set 
using tije multivariate statistical algoritiun was a minimum, the algoritiun based on 
Uie spectral input diat minimized misclassification between the two tissue types under 
consideration was implemented on die prediction data set. 

Three multivariate statistical constituent algorithms were developed using 
tissue spectra at tiiree excitation wavelengths. Constituent algoritiun (1) was 
developed to differentiate between SILs and nonnal squamous epitiielia; constituent 
algoritiun (2) was developed to differentiate between SILs and normal columnar 
epitiielia and constituent algoritiun (3) could be used to discriminate between LG SILs 
and HG SILs. A constituent algoritiun which can discriminate between SILs and 
tissues witii inflammation could not be developed using spectral data from tiie current 
clinical study. 
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Development of composite algorithms 

Each of the independently developed constituent algorithms was intended to 
discriminate only between pairs of tissue types. A combination of these constituent 
algorithms was required to provide discrimination between several of the clinically 
relevant tissue types. Therefore, two composite algorithms were developed: a 
composite screening algorithm was developed to differentiate between SILs and non 
SJL& (normal squamous and columnar epithelia and inflammation) using constituent 
algoritimis (1) and (2) and a composite diagnostic algoritiun was developed to 
differentiate HG SILs from non HG SILs (LG SILs. normal epithelia and 
inflammation) using all three constituent algoritiims. 

The composite screening algorithm was developed m the foUowing manner. 
First, constituent algoritimis (1) and (2) were developed independentiy using die 
caUbration data set. The classification outputs from both constituent algoridmis were 
used to determine if a sample being evaluated is SIL or non SIL: first, using 
constituent algoritimi (1), samples were classified as non SIL if tiiey had a probabiUty 
tiiat is less than OJ; otherwise, they were classified as SIL. Next, only samples that 
were classified as SIL based on the algorithm (1) were tested using algorithm (2). 
Again, samples were classified as non SEL if tfieir posterior probability was less tijan 
0.5; oUierwise Uiey were classified as SIL The spectral data from the. prediction set 
was evaluated using the composite screening algoritiun in an identical manner. 

The composite diagnostic algoritiun was implemented in die foUowing 
manner. The Uiree constituent algoritiuns were developed independentiy using die 
caUbration set. Algorithms (1) and (2) were implemented on each sample from tiie 
calibration data set, as described previously. Only samples tiiat were classified as SIL 
based on algoritiims (1) and (2) were tested using algoritiim (3). If samples evaluated 
using algoritiun (3) had a posterior probability greater tiian 0.5, tiiey were classified as 
HG SIL\ otiierwise tiiey were classified as non HG SIL. The spectral data from tiie 
prediction set was evaluated using tiie comp<75i7c diagnostic algoritiim in an identical 
manner. 
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ResuJts. 

Constituent algorithms (1). (2) and (3) 

Table 2 summarizes the components of the optimal set of three constituent 
algorithms. Constituent algorithm (1) can be used to differentiate between SILs and 
nonnal squamous epithelia; algorithm (2) differcnUates between SILs and nonnal 
columnar epithelia and algorithm (3) discriminates between LG SILs and HG SILs. 
Pre-processing 

FIG. 25(a) illustrates average fluorescence spectra per site acquired from 
cervical sites at 337 nm excitation from a typical. patient AU fluorescence intensities 
are reported in the same set of calibrated units. Corresponding normalized and 
normalized, mean-scaled spectra are iUustrated in HO. 25(b) and 25(c). respectively. 
Evaluation of the original spectra at 337 nm excitation (Fig. 25(a)) indicates that the 
fluorescence intensity of SILs is less than that of the corresponding normal squamous 
tissue and greater than that of the corresponding nonnal columnar tissue over the 
entire emission spectrum. Examination of normalized spectra from this patient (Rg. 
25(b)) indicates that following normalization, the fluorescence intensity of the normal 
squamous tissue is greater tiian Uiat of corresponding SILs.oyer. die wavelength range 
360 to 450 nm only; between 460 and 600 nm. the fluorescence intensity of SILs is 
greater tiian that of Uie corresponding nonnal squamous tissue which in pan leflecls 
die longer peak emission wavelength of SILs. A comparison of the spectral line shape 
of SILs to that of the nonnal columnar tissue illustrates the .opposite phenomenon. The 
nonnalized fluorescence intensity of SILs is greater tiian tiiat of tiie conesponding 
normal columnar tissue over the wavelengdi range 360 to 450 nm; however, between 
460 and 600 nm. tije fluorescence intensity of die normal columnar tissue is greater 
than that of tiie SILs; dus spectral difference reflects tiie longer peak emission 
wavelength of the normal columnar tissue relative to tiiat of SELs. Further evaluation 
of nonnalized spectra in Fig. 25(b) indicates that there are spectral Une shape 
differences between LG SILs and HG SILs over die wavelength range 360 to 420 nm. 

The corresponding normalized, mean-scaled spectra of tiiis patient, shown in 
Fig. 25(c) displays differences in die nonnalized fluorescence spectrum (Fig. 25(b)) 
from a particular site widi respect to the average normalized spectrum (die average of 
all normalized spectra obtained from, dus patient). As die average normalized 
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specman has been subtracted from each nonnalized spectrum obtained from this 
patient, the mean now lies at Y=0 over the entire emission wavelength range. 
Evaluation of Hg. 25(c) indicates that between 360 and 450 nm. the normalized, 
mean-ibaled fluorescence intensity of the normal squamous tissue is greater than the 
mean, and that of the normal columnar tissue is less than the mean. Above 460 nm. 
the opposite phenomenon is observed; the fluorescence intensity of the nonnal 
squamous tissue is less than the mean. whUe that of the normal columnar tissue is 
greater than the mean. The fluorescence intensiQr of SE.S lies close to the mean and is 
bounded by the intensities of the two normal tissue types. In addition, between 360 
and 420 nm. the normalized, mean-scaled fluorescence intensity of the LG SIL is 
sUghtly greater than the mean, while Uiat of the HG SIL is less than tiie mean. 

HG. 26(a) iUustrates average fluorescence spectra per site acquired from 
cervical sites at 380 nm excitation, from the same patient. HG. 26(b-c) show the 
corresponding normalized, and normalized, mean-scaled. spectra, respectively. In Rg. 
26(a). the fluorescence intensity of SILs is less, than that of. the coiresponding normal 
squamous tissue, with the LG SIL exhibiting die weakest fluorescence intensity over 
Uie entire emission spectrum. Note that tiie fluorescence intensity of tiie normal 
columnar sample is mdistinguishable from that of tht HG SIL. Normalized spectra at 
380 nm excitation, (26(b)), indicate that over the wavelength range 400 to 450 nm. the 
fluorescence intensity of the normal squamous tissue is sUghUy greater than that of 
SILs and that of the normal columnar tissue is Jess than that of SILs. The opposite 
phenomenon is observed above 580 nm. A careful, examination of the spectra of tiie 
LG SIL and HG SIL indicates that between ,460 and 580 nm. tiie normalized 
fluorescence intensity of the LG SIL is higher tiian tfiat of the HG SJL The 
normalized, mean-scaled spectra (Fig. 26(c)) enhances tiie previously observed 
nonnalized spectral line shape differences by displaying Uiem relative to the average 
normalized spectrum of tiiis patient. Rg. 26(c) indicates tiiat between 400 to 450 nm. 
Uie fluorescence intensity of tiie normal squamous tissue is greater Uian tiie mean and 
tiiat of tiie normal columnar tissue is less than the mean. The opposite phenomenon is 
observed above 460 nm. The fluorescence intensity of tiie SILs is bounded by tiie 
intensities of tiie two normal tissue types over die entire emission spectrum. The LG 
SIL and HG SIL also show spectral line shape differences; above 460 nm. tiie 
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normalized, mean-scaled fluorescence intensity of the LG SIL Ues above the mean and 
that of the HG SIL lies below the mean. 

HG. 27(a-c) illustrate original, normalized and normalized, mean-scaled 
spectra,' respecuvely at 460 nm excitation from the same paUent. Evaluation of Fig. 
27(a) indicates that the fluorescence intensity of SILs is less than that of the 
corresponding normal squamous tissue and greater than that of the corresponding 
normal columnar sample over the entire emission spectrum. EvaluaUon of normalized 
spectra at this excitation wavelength (Fig. 27(b)) demonstrates that below 510 nm. the 
fluorescence intensity of SILs is less than that of the normal squamous tissue and 
greater than that of the corresponding normal columnar tissue. Above, 580 nm. tiie 
normalized fluorescence mtensity of SILs is less dian that of Uie normal columnar 
tissue and greater Uien that of normal squamous tissue. Note that Uieie are spectral 
line shape differences between tiie LG SIL and HG SIL between 580 and 660 nm; the 
normalized fluorescence intensity of the LG SIL. is greater than that of the HG SIL 
The nonnalized, mean-scaled spectra shown in Fig. 27(c) reflects the differences 
observed in the normalized spectra relative to the average normalized spectrum of tiiis 
patient. Below 510 nm. tiie fluorescence intensity of tiie normal squamous tissue is 
greater tiian tiie mean. whUe tiiat of tiie normal columnar tissue is less than tiie mean. 
Above 580 nm. tiie opposite phenomenon is observed. The fluorescence intensity of 
tiie SILs lies between diose of tfie two normal tissue types. Above 580 nm, tiic 
fluorescence intensity of tiie LG SIL is greater than the mean and tiiat of tiie HG SIL is 
less than the mean. . , 

Principal Component Analysis and Logistic Discrimination 

Constituent algorithm (1) which differentiates SILs from normal squamous tissues 

A constituent algoritiim based on normalized spectra arranged in series at all 
tiiree excitation wavelengtiis provided tiic greatest discrimination between SILs and 
normal squamous tissues. The algoritiun demonstrated an incremental improvement in 
sensitivity witiiout sacrificing specificity relative to tiie previously developed 
constituent algoritiim (1) tfiat employed normalized, mean-scaled spectra at 337 nm 
excitation only. Multivariate statistical analysis of normalized tissue spectra at aU 
tiiree excitation wavelengtiis. indicated tiiree principal components show statistically 
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significant differences between SJLs and nonnai squamous tissues (Table 2). These 
three principal components account coUecUvely ifor 65% of the total variance of the 
spectral data set. Logistic discrimination was used to develop a classification 
algorithm to discriminate between SE-s and nonnai squamous epithelia based on these 
5 three informative principal components. Prior probabUities were determined by 
calculating the percmtage of each tissue type from the data set: 62% normal 
squamous tissues and 38% SILs. The cost of misclassification of SJL was optimized at 
0.7. Posterior piobabilities of belonging to each tissue type were calculated for all 
samples from the data set, using the known prior probabilities, cost of 

10 misclassification of SJLs and the conditional joint probabilities calculated from the 
normal probability density function. HG. 28 iUustrates the retrospecUve accuracy of 
the algorithm applied to the calibration data set. The posterior probability of being 
classified into the STL category is plotted for all SILs and normal squamous epitheUa. 
HG. 28 indicates that 92% of HG SILs and 83% of LG SILs are correcUy classified 

15 with a posterior probability greater than 0.5. Ajpproximately 70% of colposcopically 
normal squamous epithelia are correctly classified, with a posterior probability less 
than 0.5. 

The confusion matrix in Table 3 compares the retrospective accuracy of the 
algorithm on the calibration data set to its prospective accuracy on the prediction set. 

20 In the confusion matrix, the fust row corresponds to the histo-pathologic classification 
and the fust column corresponds to the spectroscopic classification of the samples. A 
prospective evaluation of the algorithm's accuracy indicates that there is a small 
increase in the proportion of correctfy classified IjG SILs and no change in the 
proportion of coirectly classified HG SILs or normal squamous tissues. Note that the 

25 .majority of normal columnar tissues and samples with inflammation from both 
calibration and predicticm sets are misclassified as SIL using this algorithm. 
Evaluation of the misclassified SILs from the calibration set indicates that one sample 
(out of 19) with CIN ID, two samples (out of 16) with CIN n. two samples (out of 16) 
with CIN I and two samples (out of 7) with HPY are incorrectly classified. From the 

30 prediction set, two samples (out of 19) with CIN m, one samples (out of 1 6) with CIN 
n, two samples (out of 16) with CIN I and one sample (out of 8) with HPV are 
incorrectly classified as non SIL. 
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Constituent algorithm (2) which differentiates SILsfrom normal columnar tissues 

The greatest discrimination between SILs and normal columnar epithelia was 
achieved using a constituent algorithm based on normalized, mean-scaled spectra at 
5 all three excitation wavelengths. This algorithm demonstrated a substantially 
improved sensitivity for a similar specificity relative to the previously developed 
constituent algorithm (2) which used normalized, mean-scaled spectra at 380 nm 
excitation, only. Multivariate statistical analysis of a combination of nonnalized, 
mean-scaled tissue spectra at all three excitation wavelengths resulted in four 

10 principal components that demonstrate statistically significant differences between 
SILs and normal columnar epitheUa (Table 2). These four principal components 
collectively account for 80% of the total variance of the spectral data set. Logistic 
discrimination was employed to develop a classification algorithm to discriminate 
between SILs and normal columnar epithelia. The prior probabilities were determined 

15 to be: 28% normal columnar tissues and .72% SILs. The optimized cost of 
misclassification of SIL was equal to 0.58. Posterior probabilities of belonging to each 
tissue type were calculated for all samples from the dajta set. FIG. 29 illustrates the 
retrospective accuracy of the algorithm q>Rlied. to the calibration data set. The 
posterior probability of being classified into the SIL category is plotted for all SILs 

20 and normal columnar samples examined. FIG. 29 graphically indicates that 91% of 
HG SILs and 83% of LG SILs have a posterior probability that is greater than 0.5. 
Seventy-six percent of colposcopically normal -columnar epithelia are correctly 
classified with a posterior probability less than 0,5.. 

The confusion matrix in Table 4 compares the retrospective accuracy of the 

25 constituent algorithm on the calibration data set to its prospective accuracy on the 
prediction set The prospective accuracy of the algorithm (Table 4) indicates that there 
is a small increase in the proportion of correctly classified LG SILs and a small 
decrease in the proportion of correctly classified HG SILs; there is approximately a 
10% decrease in the proportion of correctly classified normal columnar tissues. Note 

30 that the majority of normal squamous tissues and samples with inflammation from 
both the calibration and prediction sets arc misclassificd as SIL using this algorithm. 
Evaluation of the misclassified SDLs from the calibration set indicates that three 
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samples (out of 16) with ON n. three samples (out of 16) with CIN I and one sample 
(out of 7) with HPV are incorrecUy classified. From the prediction set. two samples 
(out of 19) with CIN m, three samples (out of 16) with CIN H. and three samples (out 
of 16) with CIN I are incorrectly classified. 

Constituent algorithm (3) which d^erentiates HG SILs and LC SILs 

A combination of normalized spectra at all three recitation wavelengths 
significandy oihanced the accuracy of die previously developed constituent algoritfun 
(3) which differentiated HG SILs from LG SILs using nonnalized specua at 460 nm 
excitation. Multivariate statistical analysis of normalized spectra at all tiiree excitation 
wavelengtiis resulted in four statistically significant principal components, that 
account collectively for 67% of tiie total variance of tiie spectral data set (Table 2). 
Again, a probability based classification algoritiun was developed to differentiate HG 
SILs from LG SILs. The prior probability was: 40% LG SILs and 60% HG SILs. The 
optimal cost of misclassification of HG SIL was equal to .0.51. Posterior probabilities 
of belonging to each tissue type were calculated. . FIG. 30 illustrates die retrospective 
accuracy of tfie algoritfmi ^plied to die calibration data set. The. posterior probability 
of being classified into the HG SIL category is plotted for all SILs evaluated. Fig. 30 
indicates tiiat 83% of HG SILs have a posterior probability greater dian 0.5, and 70% 
of LG SILs have a posterior probability less tiian 0.5. 

The confusion matrix in Table 5 compares the retrospective accuracy of the 
constituent algorithm on the calibration set to its prospective accuracy on die 
prediction set. Its prospective accuracy indicates tiiat tiiere is a 5% decrease in die 
proportion of corrcctiy classified LG SBLs and no change in die proportion of conecdy 
classified HG SILs. From die calibration set, six HG SILs are misclassified; three 
samples (out 19) widi ON m and three samples (out of 16) widi CIN n are 
misclassified as LG SIL. The misclassified LG SILs comprise of five samples (out of 
16) wiUi CHN I and two samples (out of 7) with HPV. From the prediction set. five HG 
SILs are misclassified: two samples (out of 19) widi CIN III and duee (out of 16) widj 
CIN n. There were ten misclassified LG SILs from die prediction set: seven with CIN 
I (out of 16) and duee (out of 8) wiUi HPV. 
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"FulUparameter'' composite screening and diagnostic algorithms 

A composite screening algorithm was developed to differentiate SE-s and non 
SILs (normal squamous and columnar epithclia and inflammation) and a composite 
diagnostic algorithm was developed to differentiate HG SE-s from non HG SILs (LX3 
SE-s, normal epithelia and inflanmiadon). The effective accuracy of both composite 
algorithms were compared to those of the constituent algorithms from which they 
were developed and to the accuracy of current detection modalities. 

A composite screening algorithm which discriminates between SILs and non SILs 

A composite screening algorithm to differentiate SILs from non SILs was 
developed using a combination of the two constituent algorithms: algorithm (1) which 
differentiates SRs from normal squamous tissues and algorithm (2) which 
differentiates SILs from normal columnar epithelia. The optimal cost of 
miclassification of SIL was equal to 0.66 foi constituent algoritiim (1) and 0.64 for 
constituent algoritimi (2). Only the costs of misclassification of SIL of Uie two 
constituent algoritiims was altered for the development of the composite screening 
algorithm. These costs were selected to minimize the total number of misclassificd 
samples. 

The accuracy of the composite screening algorithm on the calibration and 
prediction data sets is illustrated in the confiision matrix in Table 6. Examination of 
the confusion matrix indicates that the algorithm correctly classifies approximately 
90% of HG SILs and 75% of LG SIL from tiife calibration data set. Furthermore, 
approximately, 80% of normal squamous tissues and 70% of normal columnar 
epitiiclia from the calibration set are conectiy classified. Evaluation of die prediction 
set indicates that tiiere is a small change in the proportion of correctly classified HG 
SILs and LG SILs. There is a negligible change in. the correct classification of normal 
squamous and columnar tissues. Note that while 80% of samples with inflammation 
from the calibration set are inconrectiy classified. as. Sm only 43% of tiiese samples 
from die prediction set are incorrecdy classified : . . 

A comparison of the accuracy of the composite screening algorithm (Table 6) 
to tiiat of each of the constituent algorithms (1) (Table 3) and (2) (Table 4) on die 
same spectral data set indicates that in general, tiiere is less tiian a 10% decrease in die 
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proportion of coirecUy classified SILs using the composite screening algorithm 
relative to using eiUier of the constituent algorithms independently. Note however that 
the proportion of concctly classified normal (squamous and columnar) cpithelia is 
substantially higher using the composite algorithm relative to using either of the 
constituent algorithms independently. These results confimi that utilization of a 
combination of die two consrituent algoritiuns, sigiiificantiy reduces the felse-positive 
rate relative to tiiat using each algoritimi independendy. Evaluation of die 
spectroscopicafly misclassified SILs from die calibration set (Table 6) indicates that 
only one sample (out of 19) widi ON m, three samples (oiit of 16) widi CIN n. two 
samples (out of 16) witii CIN I and four samples (out of 7) wiUi HPV are incorrecdy 
classified. From die prediction data set (Table 6). two samples (out of 19) with CIN 
m, four samples (out of 16) mth CIN II, diree samples (out of 16) widi CIN I and one 
sample (out of 8) widi HPV are incorrecdy classified. 

A composite diagnostic algonthm which differifntiites HG SlLs from non HC SILs 

A composite diagnostic algoridun which differentially detects HG SILs was 
developed using a combination of all diree constituent algotirhms: algoridim (1) 
which differentiates SILs from nomal squanious tissues, algoiidim (2) which 
differentiates SILs from normal columnar epidiclia and algoridun (3) which 
differentiates HG SILs from LG SBLs. The optimal costs of miclassification of Stt. 
was equal to 0.87 for algoridim (1) and 0.65 for algorithm (2); the optimal cost of 
misclassification of HG SIL was equal to 0.49 for algoridim (3). Only die costs of 
misclassification of SIL of constituent algorithms (1) and (2) and die cost of 
misclassification of HG SIL of constituent aigoiithm (3) were altered during 
development of die composite diagnostic algoridun. These costs were selected to 
minimize the total number of misclassified samples. 

The results of die composite diagnostic algoridun on die calibration and 
prediction sets are shown in die confusion matrix in Table 7. The algoridun correcUy 
classifies 80% of HG SILs, 74% of LG SILs and more dian 80% of normal epidieUa. 
Evaluation of die prediction set using diis composite algoridim indicates that diere is 
only a 3% decrease in die proportion of conecdy classified HG SILs and a 7% 
decrease in die proportion of correcdy classified LG SILs. There is less dian a 10% 
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decrease in the proportion of correctly classified nomal epithelia. A comparison 
between the calibration and prediction sets indicates that while more than 70% of 
samples with inflammation from the calibration data set are incorrectly classified as 
HG SIL. only 14% of samples with inflanunation from the prediction set are 
inconecUy identified. Due to the relatively small number of samples examined in this 
histo-paihologic category, the results presented here do not conclusively establish if 
the algorithm is capable of correctly identifying inflammation. 

A comparison of the accuracy of the composite diagnostic algorithm to that of 
constituent algorithm (3) which differentiates HG SBLs fitjm LG SlLs (Table 5) 
indicates there is less than a 5% decrease in the proportion of correctly classified HG 
SILs and a 5% increase in the proportion of correcUy classified LG SILs using the 
composite diagnostic algorithm relative to using the constituent algorithm (3). 
Evaluation of the HG SILs from the calibration set (Table 7) that were incorrecUy 
classified indicates that three samples (out of 19).\yith iQIN Dl and four samples (out 
of 16) with ON n are incorrecay classified. From the prediction set, four samples (out 
of 19) with CIN m and five samples (out of 16) with CIN H are incorrecUy classified. 
"Reduced-parameter" composite screening and diagnostic algorithms 

Component Loadings'. A component loading represents the correlation between each 
principal component and the original pre-processed fluorescence emission spectra at a 
particular excitation wavelength. FIG. 31(a-c) illustrate component loadings of the 
diagnostically relevant principal components of constituent algorithm (1) obtained 
from nonnaUzed spectra at 337, 380 and 460 nm excitation, respectively. HG. 32(a-c) 
display component loadings that correspond to the diagnostically relevant principal 
components of constituent algorithm (2) obtained from normalized, mean-scaled 
spectra at 337, 380 and 460 nm excitation, respectively. Finally. FIG. 33(a-c) display 
the component loadings corresponding to die diagnostically relevant principal 
components of constituent algorithm (3), obtained from normalized spectra at 337, 
380 and 460 nm excitation, respectively. La each graph shown, the abscissa 
corresponds to the emission wavelength range at a particular excitation wavelengtii 
and the ordinate corresponds to tht correlation coefficient of the component loading. 
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Conelaiion coefficients of the component loading above 0^ and below -0.5 are 
considered to be significant 

FIGS. 31(a), 32(a) and 33(a) display component loadings of principal 
components of constituent algorithms (1), (2) and (3). respectively, obtained from pre- 
processed spectra at 337 nm excitation. A closer examination indicates that 
component loading 1 is nearly identical for all three algorithms. Evaluation of this 
loading indicates that it is positively correlated with corresponding emission spectra 
over the wavelength range 360-440 nm and negatively correlated with corresponding 
emission spectra over the wavelength range; 460-660- nm. All remaining principal 
components of all three algorithms display a correlation . between -0.5 and 0.5. except 
component loading 4 of algorithm (2) (Fig;.. 32(a)) which displays a positive 
correlation of 0.75 with the corresponding emission spectra at 460 nm. 

FIGS. 31(b). 32(b) and 33(b) display component loadings that correspond to 
die diagnostically relevant principal components o( constituent algoritimis (1), (2) and 
(3), respectively obtained from pre-processed spectra at 380 nm excitation. 
Component loading 1 of all tiiree algoritiims is positively correlated witii 
corresponding emission spectra over the wayelengtii range. 400450 nm. Between 
500-600 nm. component loading 1 of algpriUmi (2) (Fig. 32(b)) is correlated 
negatively with corresponding emission spectra.; Eijamination of component loading 3 
of algorithm (1) (Fig. 31(b)) and algorithm (3) (Fig. 33(b)) indicates tiiat they are also 
negatively cmrelated wiUi corresponding emission spectra from 500-600 nm. Only 
component loading 2 of algorithm (2) (Fig.; 32(b)) is positively correlated with 
corresponding emission spectra from 500-600 nn>.; Also note tiiat component loading 
3 of algoritiim (1) (Fig. 31(b)) and component loadings 3 and 6 of algorithm (3) (Fig. 
33(b)) display a correlation with corresponding emission spectra at approximately 640 
nm. 

FIGS. 31(c), 32(c) and 33(c) display component loadmgs that correspond to 
tiie diagnostic principal components of constituent algoritiims (1), (2) and (3), 
respectively obtained ftom pre-processed spectra at 460 nm excitation. Note that only 
cooqwnent loading I displays a negative correlation (< -OJ) widi corresponding 
emission spectra for all three algorithms. This component loading is correlated witii 
corresponding emission spectra over tiie wavelength, range 580-660 nm. The 
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remaining principal components of all three algorithms display a coirelation between - 
0.S and 0.5. 

The component loadings at all three excitation wavelengths of aU three 
constituent algorithms were evaluated to select fluorescence intensities at a minimum 
number of excitation-emission wavelength pairs required for the previously developed 
constituent and composite algorithms to perfonn with a minimal decrease in 
classification accuracy. Portions of the component loadings of the three constituent 
algorithms most highly correlated (correlation > 0.5 or < -0.5) with corresponding 
emission spectra at each excitation wavelength were selected and the reduced data 
matrix was then used to regenerate and evaluate the constituent and composite 
algorithms, ft was iteratively detcimined that fluorescence intensities a: a minimum of 
15 excitation-emission wavelength pairs are required to re-develop constituent and 
composite algorithms that demonstrate a minimum decrease in classification accuracy. 
At 337 nm excitation, fluorescence intensities at two emission wavelengths between 
360-450 nm and intensities at two emission wavelengths between 460-660 nm were 
selected. At 380 nm exciution. intensities at two.emission wavelengths between 400- 
450 nm and intensities at four emission wayel?ngths between 500-640 nm were 
selected. Finally, at 460 nm excitation, fluorescence intensities at five emission 
wavelengths over the range 580-660 nm was selected. Table 8 lists these excitation- 
emission wavelength paks for each of the three coruf/menr algorithms. (1), (2) and 
(3). These excitation-emission wavelength pairs are also indicated on the component 
loading plots in Figs. 31-33. The bandwidth at each emission wavelength is 10 nm. 

Reduced-parameter composite algorithms 

Using the fluorescence intensities only at the selected excitation-emission 
wavelength pairs, the three constituent algorithms were re-developed using tiie same 
formal analytical process as was done previously using die entire fluorescence 
emission spectra at all three excitation wavelengths (Fig. 24). The Uirce constituent 
algorithms were then independenUy optimized using the calibration set and tested 
prospectively on the prediction data set. They were combined as described previously 
into composite screening and diagnostic algorithms. The effective accuracy of these 
reduced-parameter composite algorithms were compared to that of the full-parameter 
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composite algorithms developed previously using fluorescence emission spectra at all 
three excitation wavelengths. 

Table 9 displays the accuracy of the reduccd-paramctcr composite screening 
algorithm (based on fluorescence intensities at 15 excitation-emission wavelength 
pairs) which discriminates between SILs and non SILs applied to the calibradon and 
prediction sets. A comparison between the calibradon and prediction data sets 
mdicates that there is less than a 10% decrease in the proportion of correctly classified 
SILs and normal squamous tissues from the prediction set Note however that there is 
a 20% increase in the proportion of correcdy classified normal columnar epiUielia and 
a 40% increase in the proportion of correcdy classified samples with inflammation 
from the prediction set. 

The accuracy of the reduced-parameter composite screening algoridun (Table 
9) was compared to that of the full-parameter composite screening algoridim (Table 
6) applied to the same spectral data set A compari§on indicates diat in general there is 
less than a 10% decrease in the accuracy of the reduced-parameter composite 
algorithm relative to that of the full-parameter composite screening algorithm, except 
for a 20% decrease in the proportion of correctiy classified normal columnar epitiielia 
firom die calibration set tested using the reduced-parameter composite screening 
algoridmi (Table 9). 

Table 10 displays tiie accuracy of the reduced-parameter composite diagnostic 
algoridmi tfiat differentially identifies HG SILs from die calibration and prediction 
sets. A comparison of sample classification between die calibration and prediction 
data sets indicates tiiat there is negligible change in die proportion of correctiy 
classified HG SILs, LG SILs and normal squamous epidielia. Note diat there is 
approximately a 20% increase in die proportion of correcdy classified normal 
columnar epidielia and samples widi inflanunation from the prediction set. 

A comparison of the composite diagnostic algorithm based on the reduced 
emission variables (Table 10) to diat using fluorescence emission spectra at all diree 
excitation wavelengths (Table 7) applied to the same specu-al data set indicates diat in 
general, the accuracy of the reduced-parameter comp^jj/r^ diagnostic algorithm is 
widiin 10% of that reported for the full-parameter comp^^jif^ diagnostic algorithm; 
however, a comparison between Tables 7 and 10 indicates diat diere is approximately 
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a 15% decrease and a 20% increase in the proportion of correcUy classified normal 
columnar epithelia from the calibration and prediction sets (Table 10). respectively 
which were tested using the reduced-parameter composite diagnostic algorithm. The 
opposite trend is observed for samples with inflammation tested using the reduced- 
parameter composite diagnostic algorithm (Table 10). 

Table 1 1 compares the sensitivity and specificity of the full-parameter and 
reduced-parameter composite algorithms to that of Pap smear screening and 
colposcopy in expert hands. Table 11 indicates that the composite screening 
algorithms have a similar specificity and a significantly improved sensitivity relative 
to Pap smear screening. A comparison of the sensitivity of the composite screening 
algorithms to that of colposcopy in expert hands for differentiating SILs from non 
SILs indicates tiiat these algorithms demonstrate a 10% decrease in sensitivity, but a 
20% improvement in specificity. The composite diagnostic algorithms and colposcopy 
in expert hands discriminate HG SILs from, non HG SILs witii a very similar 
sensitivity and specificity. Also note tiiat the variability (standard deviation) of both 
Pap smear screening and colposcopy in expert hands is substantially higher than that 
of die full-parameter and reduced-parameter sqree^iing and diagnostic algorithms. A 
comparison between the full-parameter and reduced-parameter composite algorithms 
indicates that the algorithms based on die reduced emission variables demonstrate a 
minimal decrease in classification accuracy relative to those that employ fluorescence 
emission spectra at all three excitation wavelength^. 

Discussion and Conclusions 

Cervical tissue fluorescence spectra recorded at 337, 380 and 460 nm 
excitation can be used to develop composite screening and diagnostic algoritiims for 
the differential detection of SILs in vivo. The composite screening algorithm 
discriminates between SILs and non SILs with a similar specificity and a substantially 
improved sensitivity relative to standard P^ smear screening. When compared to 
colposcopy in expert hands, the composite screening algorithm displays a 10% 
decrease in sensitivity but almost a 20% improvement in specificity. A comparison 
between the composite diagnostic algorithm and colposcopy in the hands of expert 
practitioners indicates that both have a very similar sensitivity and specificity for 
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disoiminating between HG STLs and non HG STLs. Note that as spectroscopic 
intenogation of diseased and non-diseased cervical tissue sites in the current clinical 
study was directed by colposcopic impression, the sensitivity of the spectroscopic 
algoiitJins could not exceed the sensitivity of colposcopy. In other words, if there 
were histologically diseased cervical tissue sites that were overlooked by colposcopy, 
these false-negatives were not be evaluated spectroscopically. As a result, the 
potential of fluorescence spectroscopy to correctly classify these false-negatives could 
not be determined. 

The full-parameter composite algorithms were re-developed using 
fluorescence intensities at 15 excitaUon-emissioii wavelength pairs, to generate 
reduced-parameter composite algorithms. The fluorescence intensities at these reduced 
number of excitation-emission wavelength pairs were selected using a parameter 
called the component loading calculated from the principal components. Evaluation of 
the reduced-parameter composite algorithms indicates that they display a minimal 
decrease in sensitivity and specificity relative, to the fuU-parameter composite 
algorithms. The reduction in the number of excitation-emission wavelength pairs from 
161 to 15 implies reduction in the complexity and cost of the portable fluorimeter 
which would be used to measure cervical tissue .fluorescence. For example, if 
fluorescence intensities at only 15 excitation^emission wavelength pairs need to be 
measured, the polychromator and intensified ^ . diode , array can be replaced by a 
mechanical filter assembly and a single channel detector. This represents a substantial 
decrease in cost and complexity of this instrumentation at the expense of less than a 
1 % decrease in sensitivity. 

Several significant improvements and refinements have been made in 
previously developed constituent algorithms using tissue spectra at ail three excitation 
wavelengths. Previously, the constituent algorithm (1) which differcnUates SBLs from 
normal squamous epithelia was developed using normalized, mean-scaled spectra at a 
single excitation wavelength: 337 nm. Specffa at this excitation wavelength had to be 
mean-scaled in order to calibrate for the significant inter-patient variation in spectral 
line shape. This algorithm demonstrates the greatest classification accuracy when the 
patient being evaluated has equal numbers of diseased and non-diseased tissue sites. 
This restriction dearly reduces the clitiical effectiveness of this algorithm. The new 



wo 99/57529 PCTAJS99/09768 

64 

algorithm which is based on normalized emission spectra at all three excitation 
wavelengths, minimizes this inter-patient variation and hence obviates the need for 
mean-scaling, while maintaining a slightly improved classification accuracy. Inclusion 
of spectra at additional excitation wavelengths represents a significant improvement in 
the clinical effectiveness of this algorithm as it can be applied to a much wider 
population of patients. 

The accuracy of previously developed consHtuent algorithm (2) which 
discriminates between SDLs and normal columnar epithelia was significandy improved 
by using normalized, mean-scaled spectra at all three excitation wavelengths rather 
than at a single excitation wavelength. Despite the significant improvement in these 
results, this algorithm is also based on tissue specu^ that require mean-scaling at each 
excitation wavelength. A multivariate statistical algorithm based on normalized 
spectra only, at all three excitation wavelengths differentiates SELs from normal 
columnar epithelia with a significantly poorer ,seii$itivity than the algorithm that uses 
normalized, mean-scaled specu^ at all three excitation wavelengths. Therefore, mean- 
scaling is essential for the optimal operation of this algorithm. 

FmaUy. an improvement that is significant is the development of the third 
constituent algorithm which discriminates between LG SILs and HG SILs using tissue 
spectra at all three excitation wavelengths. The utilization of spectra at all three 
excitation wavelengths results in a substantial improvement in sensitivity relative to 
using the constituent algorithm (3) which is based pn a single excitation wavelengtii. 
Furthermore, spectra required for this algorithm do not have to be mean-scaled for 
inter-patient variation in spectral line shape. 

Each of the three constituent algorithms developed using spectral data from 
die current clinical study discriminate between a specific pair of tissue types. Using 
t3ch constituent algorithm, a posterior probability assignment of an unknown sample 
to a particular tissue category is calculated using a set of diagnostically relevant 
principal components diat demonstrate statistically significant differences between the 
two tissue types under consideration. The posterior probability output of the 
constituent algoritiuns are then combined to develop composite screening and 
diagnostic algoritiuns tiiat discriminate between many of the clinically relevant tissues 
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types. Hence, development of the two composite algorithms is based on the pior 
development of the three constituent algorithms. 

To test the feasibility of an alternate approach, the two composite algorithms 
were developed directly ftom diagnostically relevant principal components of their 
corresponding constituent algorithms, thereby by-passing the constituent algoridmi 
development phase. The composite screening algorithm which discriminates between 
SRs and non SJLs was developed using logistic discrimination based on the 
diagnosucally relevant principal components of constituent algoridims (1) and (2); the 
posterior probability of an tmknown sample being classified as either SE. or non SIL 
was calculated. The composite diagnostic algorithm which discriminates between HG 
SILs and non HG SILs was developed using logistic discrimination based on the 
diagnostically relevant principal components of constituent algorithms (1), (2) and (3); 
the posterior probability of an unknown sample being classified as either HG SIL or 
non HG SE. was calculated. The composite algorithms developed directly from the 
diagnosucally relevant principal components of tjieir corresponding constituent 
algorithms demonstrated a poorer classification, accuracy relative to composite 
algorithms that were developed using a combination of corresponding constituent 
algorithms. Therefore, composite screening and diagnostic algorithms were developed 
using a combination of independoiUy developed constituent algorithms. 

Pre-processing to remove inter-patient and intra-patient variation prior to the 
development of the multivariate statistical algorithm may remove the spectral 
variations that may be significant from a biological standpoint. However, in the 
development of multivariate statistical screening and diagnostic algorithms that can 
successfully identify disease in any given patient, the intra-patient and inter-patient 
spectral variations must be removed if they do obscure the important inter-category 
differences that the algcnithm needs to extract. If a sophisdcated physical model can 
be developed to describe the biological basis of the spectral data as well as the inter- 
padent and intra-patient spectral variations accurately, then this information can be 
used to develop bener methods of pre-processing or direct the need for additional 
measurements to calibrate for these variations. This is an important issue to address 
and is currently the subject of study in our laboratory. 
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In spite of the successful development of algorithms that can differentiate (1) 
SILs from nonnal tissues and (2) HG SILs from non HG SILs and normal cpithelia, 
these algorithms do not consistently classify samples with inflammation as non SIL; 
this results in a decrease in their specificity. Although the number of samples 
examined in this histo-pathologic category is limited, analysis from previous and 
current clinical smdies indicates that it relatively difficult to coirectly classify these 
samples. A plausible explanation for this is that (1) the current excitation wavelengths 
used may not be optimum for identification of fluorophores that arc unique to 
inflammation and/or (2) the penetration depth of the light may not be sufficiendy long 
to spectroscopically interrogate the underlying stromal layers where inflammation 
develops. 

The specificity of fluorescence spectroscopy for the detection of cervical 
neoplasia may be improved by using fluorescent photosensitizcrs to enhance the 
contrast between neoplastic and non-neoplastic tissues in vivo. The use of 
photosensitizcrs such as photofrin, hematoporphyrin . derivative or 5-ALA may 
potentially enhance the spectroscopic differences between neoplastic and non- 
neoplastic (normal and inflanunatory) cervical tissues and hence contribute to an 
improved specificity of the spectroscopic algorithms. 

Another limitation is that the portable fluprimeter described in this Example to 
measure in vivo tissue fluorescence spectra utilizes a single-pixel probe that 
interrogates a 1 mm diameter area on the cervix. Although the single-pixel probe that 
the inventors have used provides the capability to determine whether a small region of 
cenrical tissue contains pre-cancerous changes, mapping the entire cervix with this 
system is extremely time consuming, making wide-scale application of this 
technology impractical. To address this limitation, a multi-pixel probe that can be 
used to acquire fluorescence spectra from multiple sites on the cervix, simultaneously 
may be used* This may provide to a user not only information regarding the presence 
of pre-cancer but can also indicate its location and extent. 

In summary, in vivo fluorescence spectroscopy has the capabUity to 
significantly improve the sensitivity of Pap smear screening and the specificity of 
colposcopy in expert hands. Hence, this technique may play an important clinical role 
as a screening / re-screening tool (to screen women who have already had an initial 
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positive Pap smear, but who have not undeigone colposcopy and directed biopsy) and 
as an adjunct to colposcopy in expert hands. Advantages realized by using this 
technique include, but are not limited to: (1) screening and diagnostic information 
may be obtained in near real-time and (2) this technique may be easily automated 
hence reducing the need for subjective interpretation. Furthermore, while the Pap 
smear examines only exfoliated cervical epitiielial cells, fluorescence specttoscopy 
may interrogate the full thickness of the epitiielitim. 
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EXAMPLE 3 

Head and Neck Anafysis- Fluorescence 

Analysis of fluorescence data collected in a clinical head and neck study has 
been analyzed in accordance with the present disclosure. The Example that follows 
describes analysis of these data. 

Materials and Methods 

Fluorescence excitation emission matrices were measured in vivo from sixty 
two sites in 9 normal volunteers and 11 patients with a known or suspected 
prcmalignant or malignant oral cavity lesion. Excitation wavelength ranged from 330 
to 500 nm and emission wavelength ranged from 340 to 600 nm. Fluorescence data 
were analyzed to determine which excitation and emission wavelengths contained the 
most diagnostically useftil information and to estimate the performance of diagnostic 
algorithms based on this information. Algorithms were developed based on 
combinations of emission spectra at various excitation wavelengths in order to 
determine which excitation wavelengths contained the most diagnostic information. 
Then, at those excitation wavelengths, algorithms were developed based on reduced 
numbers of emission wavelengths to determine whether complete emission spectra 
were required or whether accurate diagnosis could be made using multi-spectral 
measurements at a few excitation/emission wavelength combinations. The algorithm 
development process, consisted of the following steps: (1) data pre-processing to 
reduce inter-patient variations, (2) data reduction to reduce the dimensionality of the 
data set, (3) feature selection and classification to develop algoritiims which maximize 
diagnostic performance and minimized the likelihood of over-training in a training set, 
(4) unbiased evaluation of these algorithms using the technique of cross-validation. 
Results 

The optimal excitation wavelengtfis for the in vivo detection of oral cancers 
with fluorescence spectroscopy were found to be 350, 380 and 400 nm. An unbiased 
estimate of an algoridmi based on the entire emission spectra at these excitation 
wavelengths yields a sensitivity of 100% and specificity of 88%. Increasing Uie 
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number of excitation wavelengths did not improve algorithm perfonnance. Better 
algorithm performance was obtained when data were normalized to the peak emission 
intensity of the concatenated vector than when each emission spectrum was 
normalized to its own peak emission wavelength. The number of emission 
wavelengths could be significanUy reduced without compromising algorithm 
performance. When only a single emission wavelength of 472 nm, common to aU 
three excitation wavelengths, was used algorithm performance on cross vaUdation was 
90% sensitivity and 88% specificity. The unbiased perfonnance estimate for the 
diagnostic algorithms based on fluorescence spectroscopy have a higher sensitivity 
than current visual screening techniques done by experts. 

Study Subjects 

9 normal volunteers and 11 patients with a known or suspected premalignant 
or malignant oral cavity lesion were recruited to participate in the smdy at the Head 
and Neck Surgery Clinical at Tbt University of T6xas M.b; Anderson Cancer Center. 
Written informed consent was obtained from each person in the smdy. 
Instrument 

A FastEEM system in accordance with die present disclosure was used for this 
stody. Briefly, the system measured fluorescence emission specua at 18 excitation 
wavelengths, ranging from 330 nm to 500 nm in 10 nm increments. The system 
incorporated a flberoptic probe, a Xenon arc lamp coupled to a monochromator to 
provide excitation light and a polychromator and thermo-electrically cooled CCD 
camera to record fluorescence intensity as a funcUon of emission wavelength. 
. Calibration 

A background EEM. to be subtracted from the acquired patient data, was 
obtained with the probe immersed in a non-fluorescent botUe filled with distilled 
water at the beginning of each measurement day. Then a fluorescence EEM was 
measured widi the probe placed on the surface , of a quartz cuvette containing a 
soluUon of Rhodamine 610 (Exciton. Dayton. OH) dissolved in ethylene glycol (2 
mg/mL). 
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. To correci for the non-unifonn spectral response of the detection system, the 
spectra of two caUbraied sources were measured; in the visible an NIST traceable 
calibrated tungsten ribbon filament lamp was used and in the UV a deuterium lamp 
was used (SSOC and 45D. Optronic Laboratories Inc. Orlando, FL). Correcuon factors 
were derived from these spectra. Background subtracted EEMs from patients were 
then corrected for the non-uniform spectral response of the detection system. 
Variations in the intensity of the fluorBscence excitation light source at different 
excitation wavelengths were corrected using measurements of the intensity at each 
excitation wavelength at the probe tip made using a calibrated photodiode (818-UV, 
Newport Research Corp.). Finally, corrected fluorescence intensities from each site 
were divided by the fluorescence emission intensity of the Rhodamine standard at 460 
nm excitation. 580 nm emission. Thus, data illustrated in this paper are not the 
absolute fluorescence intensities of tissue but rather the intensities relative to the 
Rhodamine standard. : .'-j v 

Data Aquisition 

Before the probe was used it was disinfected with Metricide (Metrex Research 
Corp.) in accordance with standard protocol, the probe was then guided into the oral 
cavity and its tip positioned flush with the mucosa. Then fluorescence EEMs were 
measured. 

Ruorescence EEMs were measured from 9 volunteers with no histoiy of oral 
cavity neoplasia at 35 clinically normal sites in the oral cavity (table 1). No biopsies 
were obtained from volunteers. Following visual screening in 11 patients with a 
known or suspected premalignant or malignant oral cavity lesion, fluorescence EEMs 
were measured from 27 sites (Table 1). The physician placed the Hber optic probe on 
a lesion or suspected lesion and the fluorescence of that site was measured. In 
addition to the three to five visually abnormal sites, fluorescence EEMs were 
measured from one to three contralateral normal sites. Post-spectroscopy, abnormal 
sites were tattooed with India Ink where the probe measured the spectra. A clinical 
diagnosis of each lesion as normal, abnormal (not dysplastic), abnormal (dysplastic) 
Qt cancerous was recorded by an experienced head and neck surgeon (AMG) or dental 
oncologist (RJ). During follow up surgery, a 2-4 mm biopsy of the tissue was taken 
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from the tattooed area. These specimens were evaluated by an experienced 
pathologist (BK) using light microscopy and classified as nonnal. mucosal reactive 
atypia (MRA), dysplasia or cancer using standard diagnostic criterion. Biopsies with 
multiple diagnoses were classified according to the most severe pathological 
diagnosis. The pathologist and clinicians ^yere bUnded to the results of the 
spectroscopic analyses. 

DataReview 

A total of 88 sites were measured from 26 subjects. AU spectra were reviewed 
by a single investigator blinded to the pathologic results (DIM). Spectra were 
discarded if files were not saved properly due to software eiror (8 sites), instiument 
error (2 sites), operator error (4 sites), probe movement (3 sites), and the presence of 
room light artifacts at wavelengths below 600 nm (3 sites) in at least one of the 
emission spectra. From the remaining sites, spectra from six sites were excluded 
because the tattoo could not be located and consequently rcUable histologic diagnosis 
was not avaUable for these sites. Therefore, fluorescence EEMs from 62 sites from 20 
subjects were available for further analysis (Table 1). 

Data Analysis 

Euorescence data were analyzed to determine which excitation and emission 
wavelengths contained Uic most diagnostically useful information and to estimate (be 
performance of diagnostic algoritiuns based on this information. Algorithms based on 
multi-variate discriminant analysis were considered: Algorithms based on 
combinations of emission spectra at various excitation wavelengths were developed in 
order to determine which excitation wavelengths contained the most diagnostic 
information. Then, at those excitation wavelengths, spectra based on reduced 
numbcR of emission wavelengths were developed to determine whether complete 
emission spectra were required or whether accurate diagnosis could be made using 
multi-spectral measurements at a few excitation/emission wavelcngtii combinations. 

In each case, tiie algorithm development process, described in detail below, 
included tire foUowing major steps: (I) data pre-processing to reduce inter-patient 
variations, (2) dau reduction to reduce the dimensionality of tiie data set, (3) feature 
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selection and classification to develop algorithms which maximized diagnostic 
performance and minimized the likelihood of over-training in a training set, (4) 
unbiased evaluation of these algorithms using the technique of cross-validation. 
Diagnostic Categories 

Multi-variaie discriminant algorithms were sought to separate two tissue 
categories: nonnal and abnormal. The abnormal class contained sites with dysplasia, 
carcinoma in situ and squamous cell carcinoma; the normal class contained sites 
which were clmically and/or histologically normal as well as benign changes such as 
inflanunation. 

Data Pre-processing 

Fluorescence data from a single measurement site is represented as a matrix 
containing calibrated fluorescence intensity as a function of excitation and emission 
wavelength. Columns of this matrix correspond lo emission spectra at a particular 
excitation wavelength; rows of this mamx correspondi to- excitation spectra at a 
particular emission wavelength. Each excitation spectrum contains 18 intensity 
measurements; each emission spectrum contains between 50 and 130 intensity 
measurements depending on the excitation wavelength- Most multi-variate dau 
analysis techniques require vector input rather than matrix input, so the column 
vectors coniaming the emission speco^ at excitation wavelengths selected for 
evaluation were concatenated into a single vector in order to explore which excitation 
wavelengths contained the most diagnostic information 

Our previous work illusu-ated that spccu-a of oral cavity obtained in vivo show 
large patient to patient variations in intensity that can be greater than the inter- 
category differences. Therefore, the inventors explored pre-processing methods to 
reduce the inter-patient variations, while preserving inter-category differences. While 
many different methods of pre-processing are possible, two methods were selected for 
evaluation here: (1) normalization of all emission spectra of a given excitation 
wavelength combination to the maximum intensity contained within that combination, 
and (2) normalization of each emission specura to its maximum intensity. 
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Reduction of Excitation Wavelength Number 

In this study, fluorescence emission spectra were measured at 18 different 
excitation wavelengths. One goal of data analysis was to detennine which 
combination of excitation wavelengths contains the most diagnostic infonnaUon. The 
inventors considered combinations of up to foiir emission spectra. Limiting the 
number of wavelengths to taar allows for construction of a reasonably cost-effective 
clinical spectroscopy system. Two strategies were considered to identify the optimal 
wavelength combination. The first was to identify tiie single wavelengtii which gives 
die best diagnostic performance, then the wavelength of those remaining that most 
improves diagnostic performance, and so forth until performance no longer improves 
or four wavelengths have been selected. The second method was to evaluate aU 
possible combinations of up to four wavelengths chosen from tiie 18 possible 
excitation wavelengtiis. This equates to 18 combinations of one, 153 combinations of 
two, 816 combinations of tiiiee, and 3.060 combinations of four excitation 
wavelengtiis, for a total of 4,047 combinations, ...While the first metiiod requires less 
computational time, it is only appropriate for normalization methods tiiat remove 
relative intensity information. OUierwise, tiie bwt single wavelengtii may not be part 
of tiie best wavelengtii pair that exploits differences in relative intensity. The second 
method can be used witii eitiier normalization scheme and in addition, provides a tool 
to rank Uie top wavelengtii combinations, ratiier tiian identifying tiie single best 
wavelength combination, so this metiiod was punued. 

Algorithm Development 

For each of tiie 4,047 combinations of one to four excitation wavelengths, 
spectra from tiie entire data set were used as a training set to develop multi-vaiiate 
algorithms to separate normal and abnormal tissues based on tiieir fluorescence 
emission spectra at aU possible wavelengtii combinations. Algoritiim development 
included of tiiree steps: (1) pre-processing, (2) data reduction and (3) development of 
a classification algoritiim which maximized diagnostic performance. Data were pre- 
processed using tiie two normalization schemes described above. For each 
normalization, principal component analysis was performed using the entire dataset 
and eigenvectors accounting for 65. 75. 85. and 95% of tiie total variance were 
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reiained. Principal component scores associated with these eigenvectors were 
calculated for each sample. DUcriminant functions were then formed to classify each 
sample as normal or abnormal. The classification was based on the Mahaianobis 
dUtanci. which is a multivariate measure of the reparation of a point from a dataset in 
n-dimensional space. Each sample was held out one at a time and the Mahaianobis 
distances between to the held out sample and the remaining normal and abnormal 
samples were calculated; the sample was classified according to the category 
corresponding to the smallest distance. The .sensitivity and specificity of the 
algorithm were then evaluated relative to diagnoses based on histopathology (in 
patients suspected to have oral cavity maUgnancy) or clinical impression (in normal 
volunteers). Overall diagnostic performance was evaluated as the sum of the 
sensitivity and tht specificity, Urns minimizing the number of misclassifications 
(when prevalence of disease and normal are approximately equal). The performance 
of tiie diagnostic algoridmi depended on tiie priacipal component scores which were 
included. Four different diagnostic algoritiuns. were developed using principal 
componem scores derived from eigenvectors accounting for increasing amounts of 
total variance. From tiie available pool of principle component scores, the single 
principal component score yielding the best initial performance was identified, and 
Uien die principal component score diat most improved Uus performance was selected. 
This process was repeated until performance is no longer improved by tiie addition of 
principal components scores, or all available scores were selected. The pool of 
available eigenvectors is specified by a variance criterion, eigenvector significance 
level (ESL), that represents the minimum variance fraction accounted for by the sum 
of the n largest eigenvalues. In Uiis work tiie inventors examined 4 ESLs, 
corresponding to 65%, 75%, 85% and 95% of tiie total variance. 

Comparing Performance of Various Excitation Wavelength Combinations 

At each ESL, tiie wavelengtii combinations were ranked in order of decreasing 
perforaiance, based upon die sum of sensitivity and specificity. TTie combinations 
were ranked and evaluated based upon training performance. However, as die ESL 
approaches 100%, over-training becomes more likely, since the available pool of 
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eigenvectors will account for nearly 100% of the variance, including variance due to 
noise. The magnitude of diagnostically important variances is unknown. 

The risk of over-training risk was assessed at the top 25 wavelength 
combinations of two, three, and four excitation wavelengths, by comparing the 
training set perfonnance to the performance of an algoiithm developed from the same 
data after the diagnoses corresponding to. each measurement site had been 
randomized. This provides a dataset with the same variance stmcnire as the original 
dataset, but where the diagnostic performance is not expected to exceed that of 
chance. In order to make equivalent comparisons, the disease pievalence in the real 
sample was maintained in the randomly assigned diagnoses. Diagnostic algorithms 
were then developed again which minimized the number of misclassified samples at a 
specified eigenvector significance level (ESL). Random diagnoses were assigned fifty 
times for each wavelength combination and the average and standard deviation of the 
sum of the sensitivity and specificity were calculated. Ideally, for completely 
normally distributed data, the sum of the sensitivity and specificity should be one for 
the randomized diagnosis at all levels of training significance. However, if over- 
training occurs, this sum will be greater than. one.. The top 25 wavelength 
combinations were then ranked again based on the absolute difference between the 
training set performance and random diagnosis assignment. This method allows the 
top wavelength combinations to be ranked in order of their robusmess, or lack of 
propensity to over-train. For a given numbo; of wavelengths per combination, the 
differences were ranked across aU four eigenvector significance levels. The largest 
difference, usually seen at ESL values of 65%. was selected as the optimal wavelengUi 
combination. This criterion selects tiie wavelength combination tiiat is least prone to 
over-Q^ining. 

Validation of Algorithm Performance 

Although the optimal wavclengdi combination has been identified based upon 
comparison of its performance to Uiat which can be achieved when the tissue 
diagnoses have been randomized, our estimates of algorithm performance are still 
biased since Uiey are based on the same training set used to develop tiie algorithm. An 
unbiased performance estimate must be made to assess the true potential of this 
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wavelength combination. The effects of over-training in perfonnance estimation can 
be minimized by using separate training and validations sets, or by using the method 
of cross-validation. The data set here was not sufficienUy large to divide into separate 
training and validation sets, therefore the inventors used the cross-validation metiiod. 
In this metiiod, all data from one patient are temporarily removed from tiie data set, 
the algorithm is developed using tiie remaining data set, and Uien die new algoritimi is 
^plied to tiie left out sites. This is repeated until data from each patient has been left 
out once. Cross validation was used to provide an unbiased estimate of the 
performance of rht top Uuee combinations of excitation wavelengtiK w'nh each 
normalization. 

Reduction of Emission Wavelength Number 

The inventors investigated wheUier effective diagnostic algorithms could be 
developed using reduced numbers of emission wavelengths at tiie top performing 
excitation wavelengdi combinations. The inventors caicuiated the component 
loadings associated witii tiie eigenvectors corresponding to tiie principal component 
scores selected in these algorithms. A component loading represents tiie correlation 
between each principal component and tiie original pre-processed fluorescence 
emission spectra at each excitation wavelengtii: The component loadings at each 
excitation wavelength were evaluated to select fluorescence intensities at a minimum 
number of excitation-emission wavelengtii pairs required for the algorithms to 
perfoim witii a mmimal decrease in classification accuracy. Portions of tiie 
component loadings most highly correlated (correlation >0.5 or <-0.5) wiUi 
corresponding emission spectra at each excitation wavelengtii were selected and tiie 
reduced data matrix was dien used to regenerate and evaluate tiie algoritiims. 
Results 

Ruorescence EEMs from 62 sites from 20 subjects were available for fimher 
analysis (Table 1). Of tiiese 62 sites, 37 were measured from tiie tongue, eight from 
tiie floor of moutii (FOM), seven from tiie buccal mucosa, four from tiie gingiva, one 
from tiie palate, and five from the lip. There were 52 normal, four dysplastic, and six 
cancerous sites. The data set consisted of two types of normal sites: adjacent normals 
and normals from a population witiiout oral cancer. Adjacent normals are tiie visually 
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normal sites taken from paUents that have suspected lesions elsewhere in the oral 
cavity. In this data set there were 17 adjacent normal (histologically normal) sites 
from eleven patients, and 35 visually normal sites taken from nine patients. 

The visual screening accuracy of the head and neck physicians for this data set 
was 100% sensiUvity and 83% specificity. This performance was determined by 
comparing the visual impressions of the clinicians to the histologic findinp upon 
excision. Results of the analysis of the spectroscopic data are presented according to 
the normalization method used. 

Normalization by peak emission intensity of the concatenated vector 

The top 25 combinations of one to four excitation wavelengths were ranked in 
order of the largest difference in the sum of the sensitivity and the specificity in the 
training set and the average performance with randomly assigned diagnoses. The top 
3 combinations correspond to the following excitation wavelength combinations: (350 
380 400 480). (350 380 400 490). and (350 380 400). All of these combinations 
demonstrate approximately the same training set performance, with 100% sensitivity 
and 90% specificity. These combinations have three wavelengths in common. Since 
no performance benefit was observed when a fourth wavelengUi was added for die top 
performing combinations, combinations of four wavclengtiis were not pursued any 
further. The top 25 combinations of Uuee excitation wavelengths, ranked in order of 
Uie largest difference in tiie sum of Uie sensitivity and the specificity in Uie training set 
and Uie average performance with randomly assigned diagnoses are given in Table 2. 
The ranking of each combination based upon trainirig set performance is given as 
weU. Table 2 gives die diagnostic performance, of each combination for botii tiie 
training set and die average performance for die data set widi randomized diagnosis. 
The random diagnosis performance demonsu-ated diat the combinations showed 
varying propensities to over-train. 

A histogram depicting die frequency at which each wavelengdi appeared in die 
top 25 combinations from Table 2 is shown in Figure 34 for various ESLs. At low 
ESL values of 65%, 75% and 85% die diagnostic importance of excitation at 350, 
380, and 400 nm is evident. This is seen in die histograms for wavelengdi 
combinations of two and four as well (data not shown). 
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To provide an unbiased estimate of perfonnance of these aJgoriihms. the 
diagnostic performance of the lop wavelength combinations was evaluated by using 
the method of cross-validation using the full dau set. The wavelength combinaUon 
(350, 380, 400 nm) demonstrated a cross validation performance of 100% sensitivity 
and 88% specificity. The other two combinations (350, 380, 400, 480 nm) (350, 380, 
400. 490 nm) demonstrated identical perfonnance upon cross validation with a 
sensitivity of 100% and a specifici^r of 90%. 

The emission spectra corresponding to all 62 sites at the three excitadon 
wavelengths common to these combinations are shown in Fig. 36. Visual 
examinaUon of Fig. 36 confirms the diagnostic potential of this wavelength 
combination. The identified combinations demonstrate the importance of the relative 
intensities as seen following normalization to tiie maximum intensity in the 
concatenated emission vector. Witii tiiis normalization, tiie normal sites demonstrate 
greater fluorescence intensity at 380 nm excitation. .450 nm emission tiian the 
abnormal sites. AdditionaUy. the remaining emission peaks tend to be more intense in 
normal sites than for abnormal sites in most instances. The normal sites misclassified 
as abnormal are easily seen in Figure 36. HistplpgicaUy. these sites demonstrated 
increased vascularity, suggesting that tiie increased hemoglobin absorption is one 
cause of the reduced relative fluorescence intensity from these sites. 

The algorithm based on die combination of 350. 380 and 400 nm excitation 
wavelengths selected only a single principal component score, associated witii the 
eigenvector that accounted for most of the total variance, Figure 37 shows tiiis 
eigenvector and tiie associated component loading. The eigenvector depicts tiie 
general lineshape of tiie normalized spectra shown in Figure 37. The component 
loading shows diat tiie principal component score for tiiis eigenvector is highly 
correlated to q)proximate]y four regions of die concatenated emission vector. Single 
emission intensities within Uiese ranges were selected arbitrarily and are denoted as 
solid green circles in Figure 37. These points correspond to tiie emission intensities of 
418 and 470 nm at 350 nm excitation, 448 nm emission at 380 nm excitation, and 
502 nm emission at 400 nm excitation. An algoritiim was developed using tiie same 
data reduction and classification metiiods as above based upon tiiis reduced data set. 
The training performance of tiie reduced algorithm is 100% sensitivity and 90% 
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specificity, and the cross-validated perfonnance is 90% sensitivity and 90% specificity 
compared to 100% sensitivity and 88% specificity of for the algoriUim based on the 
entire emission spectra. This algorithm uses a higher ESL of 95% since the reduced 
data set contains less variance introduced by noise. Motivated by tiie desire to 
construct a simple device tiiat could interrogate or image large areas of tissue, a 
reduced algoritfim based upon a single emission wavelengtii was evaluated. The 
emission wavelength chosen was common to all three emission spectra, 472 nm. The 
training performance of this reduced algorithm was 100% sensitivity. 88% specificity, 
and upon cross validation it was 90% sensitivity, and 88% specificity. 

Normalization of each emission spectra by its peak emission prior to concatenation 

The analysis was repeated using concatenated vectors in which each emission 
spectrum was normalized to its peak intensity. This method removes relative intensity 
information and relies on differences in fluorescence lineshape. The maximum 
difference between training performance and the performance after random diagnosis 
assignment was 0.58 compared to 0.82 using the other normalization metiiod. 
Consequentiy, Uie top wavelengtii combination identified (350, 380, 400, 430 nm) 
showed poor performance upon cross-validation witii a sensitivity of 50% and a 
specificity of 88%. It is interesting to note tiiat the pi«yiously identified wavelengUis, 
(350, 380. 400 nm) are also a part of tins combination, indicating that tiie line shape at 
these wavelengths contains diagnostic information. 

Discussion and Conclusions 

This Example identified tiie optimal excitation wavelengtiis for in vivo 
detection of oral cancers wiUi fluorescence spectroscopy. The optimal excitation 
wavelengtiis were found to be 350, 380 and 400 nm. An unbiased estimate of an 
algoritiim based on the entire emission spectra at tiiese excitation wavelengtiis yields a 
sensitivity of 100% and specificity of 88%. Increasing tiie number of exciution 
wavelengtiis did not improve algoritiim performance. Better algoritiim performance 
was obtained when data were normalized to tiie peak emission intensity of the 
concatenated vector tiian when each emission spectrum was normalized to its own 
peak emission wavelengtii. The discriminating ability of ibis wavelengtii combination 
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is due to differences in both relaUve intensity and spectral line shape. The number of 
eniission wavelengths could be significantly reduced as well without compromising 
algorithm performance. An algorithm based on four emission intensities: 41 8 and 470 
nm at 350 nm excitation, 448 nm emission at 380 mn excitation, and 502 nm emission 
5 at 400nm excitation yielded 90% sensitivity and 90% specificity upon cross- 
validation. When only a single emission wavelength of 472 nm, common to all three 
excitation wavelengths, was used algorithm performance on cross validation was 90% 
sensitivity and 88% specificity. 

The unbiased performance estimate for the diagnostic algoridmis based on 
10 fluorescence spectroscopy have a higher sensitivity than current visual screening 
techniques done by experts. In their hands, visual screening has been reported to have 
a sensitivity of 74% and specificity of 99%. The performance of visual screening by 
experts in this study was 100% sensitivity, 83% specificity. 

It is interesting to note that eniission spedra obtained at 400 nm «ccitation are 
15 included in a majority of the top combinations. Hemoglobin has a strong absorption 
maximum near this location, suggesting tiiat differences in absorption due to perfusion 
may offer diagnostic mformation. This suggests that the combinations of reflectance 
and fluorescence spectroscopy may offer improved.diagnostic performance. 

Head and Neck Analysis- Reflectance 

20 A FastEEM system was also used to measure tissue reflectance spectra over 

the visible region of the spectram at three source-detector fiber separations. The 
inventors have analyzed tiiese data with at least two goals: (1) to determine the 
diagnostic potential of reflectance spectroscopy for detection of neoplasia of the oral 
cavity, and (2) to determine die combmed diagnostic potential of fluorescence and 

25 reflectance spectroscopy for detection of neoplasia of the oral cavity. 

Study Design 

9 normal volunteers and 1 1 patients with a known or suspected premalignant 
or malignant oral cavity lesion were recruited to participate in the smdy at the Head 
and Neck Surgery Qinical at The University of Texas M.D. Anderson Cancer Center. 
30 Written informed consent was obtained from each person in the study. 
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Instrument 

The spectroscopic system used to measure reflectance spectra has been 
described in detail previously and is briefly summarized here. It includes of a Xenon 
arc lamp and a 295 nm long-pass filter which provides broadband illumination, a fiber 
optic probe which directs light to the tissue and collects diffusely reflected Hght from 
three locations (position 1, position 2, position 3), and an imaging spectrograph and 
CCD which detects the reflected light intensity as a function of wavelength. Fibers for 
illumination and coUection of diffuse refleaance are arranged in a ring at the edge of 
the probe. The coUection fibers are located 1.1. 2.1 and 3 mm from a single 
iUumination fiber. AH fibers have a core diameter of 200 microns. White light from 
the Xe lamp is coupled to the proximal end of the illumination fiber. The distal ends 
of Uie fibers are flush witii the probe tip and placed in direct contact witii the sample 
surface. Using tiiis system, oral cavity tissue reflectance spectra from 390-590 nm 
witii a spectral resolution of 4 nm were collected in approximately 30 seconds. TTie 
signal to noise ratio exceeded 75: 1 for 90% of die data. 

Procedure 

Reflectance spectra were wavelength calibrated witii a mercury light source. 
Dark current and background were recorded before each measurement witii tiie same 
settings but witii iUumination turned off. These background measurements were 
subffacted from each reflectance measurement offline. Reflectance data are reported 
relative to a 2.68% by volume solution of 1.072 micron diameter polystyrene 
microspheres (Polyscience Inc., Warrington, PA). The probe was placed on tiie 
outside waU of a 1 cm patii lengtii cuvette containing tiie microsphere solution. The 
total integrated reflectance of tiiis standard was measured on a double beam 
spectrophotometer (U-3300 Hitachi, Tokyo, Japan) witii an integrating sphere 
attachment (Labsphere Inc., Nortii Sutton, NH). This was used to correct tiie 
reflectance measurements of tiie microsphere solution made witii tiie spectroscopic 
system. Tissue spectra at each collection fiber position were divided pointwise by tiie 
conected standard reflectance spectrum at tiie corresponding fiber position. 

Before the probe was used it was disinfected witii Metricide (Metrex Research 
Corp.) in accordance witii standard protocol. The probe was tiien guided into tiie oral 
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cavity and its tip positioned flush with the mucosa Then reflectance spectra were 
measured. 

Reflectance spectra were measured from 9 volunteers with no history of oral 
cavity neoplasia at 35 clinically normal sites in the oral cavity (sec Table 3). No 
biopsies were obtained from volunteers. Following visual screening in II patients 
with a known or suspected premaUgnant or malignant oral cavity lesion, reflectance 
spectra were measured from 27 sites. The physician placed the fiber optic probe on a 
lesion or suspected lesion and the reflectance of that site was measured. In addition to 
the three to five visuaUy abnonnal sites, reflectance spectra were measured from one 
to three contralateral normal sites, Post-speciroscopy. abnonnal sites were tattooed 
witii India Ink where the probe measured the spectra. A clinical diagnosis of each 
lesion as normal, abnormal (not dysplastic). abnormal (dysplastic) or cancerous was 
recorded by an experienced head and neck surgeon (AMG) or dental oncologist (RJ). 
During follow up surgery, a 2-4 mm biopsy of Jhe.tissue. was taken from tiie tattooed 
area. These specimens were evaluated by an experienced pathologist (BK) using light 
microscopy and classified as normal, mucosal reactive atypia (MRA), dysplasia or 
cancer using standard diagnostic criterion. Biopsies witij multiple diagnoses were 
classified according to tiie most severe padiological diagnosis. The paUiologist and 
clinicians were blinded to tiic results of Uie spectroscopic analyses. 
Data Analysis • • • • 

Reflectance spectra were further processed to reduce noise. A moving average 
witfi a widUi of 10 nm was applied to each spectrum: following this, intensities of aU 
reflectance spectra were extracted in 5 nm steps from 400 to 585 nm and individually 
analyzed. In addition, tfie first (slope) and second derivatives of the reflectance spectra 
were calculated between 400 and 580 nm in 5 nm steps. 

An exploratory data analysis was carried out to determine which source- 
detector separations and wavelengti) regions were useful to separate three tissue 
categories: normal, dysplasia and cancer. The normal class contained sites which were 
clinically and/or histologically nomal as well as benign changes such as 
inflammation. 
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. For each diagnosuc category (normal, dysplasia, cancer) the invemore 
calculated the average value and standard deviation of the intensity at each 
wavelength, and the first and second derivative at each wavelength. These values were 
calculated separately for each source detector separation. The Student's i-iest was 
used to determine whether differences in these mean values were statistically 
significant between groups of two categories. The inventors examined noraial tissues 
vs. abnormal tissues (dysplasia and cancer) as well as normal tissues vs. dysplasia 

Parameters which were most staiisticaUy significant, corresponding to the 
lowest p-values, were examined fimhcr for diagnostic ability. The inventors 
constructed two-dimensional scatter plots which showed the most statistically 
significant parameter values for each site measured to determine which parameters 
could most effectively discriminate between the two categories of nomial and 
abnormal (dysplasia and cancer). All calculations and gr^hs were produced with the 
Matlab® (Mathworks Inc.) and the Statistical Toolbox for Matlab. 

Results 

Figures 38 through 40 show the reflectance spectra, first and second derivative 
at each of the three source detector separations for all sites measured. Figures 41 
through 43 show the average value plus and minus one standard deviation for normal, 
dysplastic and cancer sites. Normal sites arc shown in green, dysplasia in blue and 
cancer in red. In general, the spectra of cancer sites show the highest reflectance 
intensity at all wavelengths measured, while spectra of normal and dysplastic sites are 
lower in intensity and more similar. Differences in intensity are greatest at position 1 
and least at position 3. The slope and second derivative of the reflectance specua are 
greater (lower) for cancers at 440 and 480 nm (520 nm). 

Figure 44 shows the p values comparing the mean intensity, mean first and 
second derivatives of normal tissue versus abnormal tissues, at each wavelength at the 
three different source detector separations. Figure 45 shows the p values comparing 
the mean intensity, mean first and second derivatives of normal tissue versus 
dysplastic tissues, at each wavelength at the three different source detector 
separations. A low value indicates a statistically significant result; the inventors are 
particularly interested in those with values less than 0.05. 



W099/57S29 PCT/US99/W768 

84 .. 

At each source-detcaor fiber separation, the inventors ranlced the intensity, 
first and second derivatives at each wavelength in order of increasing p-value. Tables 
4-6 show the results when normal and abnormal tissues were compared. Tables 7-9 
show the results when normal and dysplasuc tissues were compared. Results are 
shown for p-values less than or equal to 0.05. 

In order to explore Uie diagnostic contributions provided by tiiese wavelength 
regions, tiie inventors highlighted all regions where tiie p-value was less tiian or equal 
to 0.01 for first and second derivatives and les$ than or equal to 0.02 for intensiQr. 
These values are highlighted in gray in tables 4-9. This resulted in a total of 15 
different parameters. The slope and second derivative near 440-460 nm at positions 1 
and 2 were identified as diagnostically useful regions, as was tiie slope and second 
derivative near 500-510 nm at position 3. The intensity from 450-51 nm and 570-585 
nm at position 2 were also identified as diagnostically useful. 

Two dimensional scatterplots containing all possible^ pairwise combinations of 
tiiesc 15 groups of parameters were generated (105 total, combinations). Figures 46- 
48 show tiiree representative examples. Figure 46 shows the second derivative at 430 
nm for position 2 vs. Ae second derivative at 495 nm for position one. The straight 
line represents an algoridmi to separate normal findings from dysplasias and cancers, 
and results In a sensitivity of 80% and a specificity of 85%. Figure 47 shows tiie 
second derivative at 450 nm for position 1 vs.- tiie first derivative at 510 nm for 
position tiiree. The straight line represents an algorithm: to separate normal findings 
from dysplasias and cancers, and results in a sensitivity of 80% and a specificity of 
82%. Figure 48 shows the second derivative at. .410 nm for position 1 vs. the first 
derivative at 510 nm for position tiiree. The straight line represents an algoritiun to 
separate normal findings from dysplasias and cancere, and results in a sensitivity of 
70% and a specificity of 75%. In each case, tite lines were drawn to minimize the 
total number of samples misclassified. These sensitivity and specificity values arc 
sligbdy lower dian tiiosc achieved in tiie previous section using fluorescence alone, 
and reflect Uie greatw overly in tiie reflectance of tissues from tiie tiiree groups tiian 
is seen in tiie fluorescence spectra. However, the fluoresceiice algoritiuns were based 
on multi-variate classifien to enable tiie use of more tiian two parameters in the 
algorithm. These techniques were next pursued using reflectance spectra. 
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Multi-Variate Discriminant Algorithms 

Renectance spectra were analyzed to determine which wavelength ranges and 
souicc-detector fiber separations contained the most diagnostically useful infonnation 
and to estimate the peifonnance of multi-variate diagnostic algorithms based on this 
information. The inventors considered algorithms based on multi-variate discriminant 
analysis. First, the inventois developed algorithms based on reflectance spectra, or 
their first or second derivatives over various wavelength ranges at each source- 
detector fiber separation in order to detennine which types of spectra, wavelength 
ranges and fiber separaUons contained the most diagnostic information. In addition, 
the inventois developed algorithms using the concatenated spectra (or their fust or 
second derivatives) at aU fiber separations over various wavelength ranges. In each 
case, the algorithm development process, described in detaU below, consisted of the 
following major steps: (1) data reduction to reduce the dimensionality of the data set, 

(2) feature selection and classification to develop algorithms which maximized 
diagnostic performance and minimized the likelihood of over-training in a training set, 

(3) unbiased evaluation of tiiese algorithms using tiie technique of cross-validation. 
Diagnostic Categories 

Multi-variate discriminant algorithms were sought to separate two tissue 
categories: normal and abnormal. The abnormal class contained sites with dysplasia, 
carcinoma in situ and squamous cell carcinoma; the normal class contained sites 
which were clinically and/or histologicaUy nonnal as well as benign changes such as 
inflammation. 

Algorithm Development 

For each of the different types of spectra and wavelengUi ranges, spectra from 
the entire data set were used as a UTiining set to develop multi-variate algorithms to 
separate normal and abnormal tissues based on their reflectance. Algoritiun 
development included two steps: (1) data reduction and (2) development of a 
classification algorithm which maximized diagnostic perfonnance. For each type of 
data, principal component analysis was perfoimed using tiie entire dataset and 
eigenvectors accounting for 65. 75. 85. 95% and 99% of Uie total variance were 
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retained. Principal component scores associated with these eigenvectors were 
calculated for each sample. Discriminant functions were then formed to classify each 
sample as normal or abnormal. The classification was based on the Mahalanobis 
distance, which is a multivariate measure of the separation of a point from a dataset in 
5 n-dimensional space. Each sample was held out one at a time and the Mahalanobis 
distances between to the held out sample and the remaining normal and abnormal 
samples were calculated; the sample was classified according to the category 
corresponding to the smallest distance. The sensitivity and specificity of the 
algorithm were then evaluated relative to diagnoses based on histopathology (in 

10 patients suspected to have oral cavity malignancy) or clinical impression (in normal 
volunteers). Overall diagnostic performance was evaluated as the sum of the 
sensitivity and the specificity, thus minimizing the number of misclassifications 
(when prevalence of disease and normal are approximately equal). The performance 
of the diagnostic algorithm depended on the principal component scores which were 

15 included. Five different diagnostic algorithms . were developed using principal 
component scores derived from eigenveaors accounting for increasing amounts of 
total variance. From the available pool of principle component scores, the single 
principal component score yielding the best initial performance was identified, and 
then the principal component score that most improved this performance was selected. 

20 This process was repeated until performance was no longer improved by the addition 
of principal components scores, or all available scores were selected. The pool of 
available eigenvectors is specified by a variance criterion, eigenvector significance 
level (ESL), that represents the minimum variance fraction accounted for by the sum 
of the n largest eigenvalues. In this work the inventors examined 5 ESLs, 

25 concsponding to 65%. 75%, 85%. 95% and 99% of the total variance. 

Comparing Performance of Various Data Types and Wavelength Ranges 

At each ESL, wavelength range and type of data the inventors calculated the 
sum of sensitivity and specificity. As the ESL approaches 100%, over-training 
becomes more likely, since the available pool of eigenvectors will account for nearly 
30 100% of the variance, including variance due to noise. The magnitude of 
diagnostically imponant variances is unknown. The risk of over-training risk was 
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a$sesM.for each of the types of input data, by coriiparing the training set performance 
to the performance of an algorithm developed from the same data after the diagnoses 
coiresponding to each measurement site had been randomized. This provides a 
dataset with the same variance stnicnire as the original dataset, but where the 
diagnostic performance is not expected to exceed that of chance. In order to make 
equivalent con^iarisons. the disease prevalence in the real sample was maintained in 
the randomly assigned diagnoses. Diagnostic algorithms were then developed again 
which minimized the number of misclassified,. samples at a specified eigenvector 
significance level (ESL). Random diagnoses were assigned fifty times for each 
wavelength combination and the average and standard deviation of the sum of the 
sensitivity and specificity were calculated. Ideally, for completely normaUy 
distributed data, the sum of the sensitivity and specificity should be one for the 
randomized diagnosis at all levels of training significance. However, if over-training 
occurs, this sum will be greater than one. At each.ESL, wavelength range and type of 
dau the inventors calculated the absolute difference between the training set 
performance and random diagnosis assignment This niethod allows the best types of 
data and wavelength ranges to be identified based on their robustness, or lack of 
propensity to over-train. Unlike our. analysis of the fluorescence from oral cavity, in 
this case, all sensitivity and specificity values were calculated for the case of cross- 
validation. This proved to be necessary since, the eigenvectors which contained 
diagnostically useful information contributed a relatively smaller amount of Uie total 
variance for reflectance tiian for fluorscence. The largest differences, were selected as 
the optimal data type and wavelengtii range. This criterion selects tiie data type and 
wavelength range that is least prone to over-training.. 

Residts • Midti-Variate Discriminant Algorithms 

Tables 10-12 show the absolute difference between the training set 
performance and random diagnosis assignment for the different data types, 
wavelength ranges and ESLs. The inventors selected an improvement of 0.5 as 
significant for first and second derivative data and an improvement of greater dian 0.4 
as significant for intensity data (since this is easier to measure in a multi-spectral 
imaging system). Wavelength ranges, data types and ESLs with at least this 
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improvement are higUigbted in Tables 10-12. Eight types of data met these criteria; 
however, the wavelength range associated with several of them overlapped 
significandy. In Uiis case, the combination with the best perfonnance increase was 
selected, resulUng in the following four combinations: (1) Intensity at position 2 from 
395-475 nm at 95% ESL, (2) Intensity at positions 1-3 from 425-500 nm at 99% ESL, 
(3) Slope at position 1 from 450-525 nm at 65% ESL and (4) Slope at position 3 from 
395-550 nm at 95% ESL. Table 13 gives the cross-validated sensiUvity and 
specificity for algorithnos based on these data types, wavelength ranges and ESLs. 
The best performance was achieved using the slope at position 3 from 395-550 nm at 
95% ESL, with a cross-validated sensitivity of 70% and a specificity of 100%. This 
compares favorably to the scatter plot shown in Figure 47. which shows the second 
derivative at 450 nm for position 1 vs. the slope at 510 nm for position three, where a 
simple Unear discriminant algorithm resulted in i sensitivity of 80% and a specificity 
of82%. 

Head and Ned: Analysis- Combination of Fluorescence and Reflectance 

In general, the performance of multi-variate algorithms based on reflectance 
spectroscopy alone was somewhat lower than that based on huorescence spectroscopy 
alone. However, from an instrumentation point of view, it may be easier to measure 
reflectance images and spectra since signal to noise ratio is higher. Therefore, the 
inventors explored die combination of reflectance and fluorescence spectroscopy and 
wheter it may provide better discrimination. Further, the inventors examined whether 
the good performance of the fluorescence algorithm may be maintained if the number 
of fluorescence excitation wavelengths were reduced, but reflectance spectra were 
measured. 

In our previous analyses, the inventors identified a combination of emission 
spectra at three excitation wavclengtii as optimal for diagnosis based on fluorescence 
spectroscopy and four types of reflectance dau which were optimal for diagnosis. The 
inventors evaluated the performance of the following combinations of data at ESLs of 
65%. 75%. 85%, 95% and 99%: (a) Fluorescence at three excitation wavelengdis + 
each type of reflectance data, (b) Huorescence at all combinations of two excitation 
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wavelengths + each type of reflectance data, and (c) Fluorescence at each single 
excitation wavelength + each type of rcnectance data. 

The perfonnance of these combinations was compared to that which could be 
achieved with fluorescence alone. Since the number of samples where both 
fluorescence and reflectance data were available was smaUer than that for cither type 
of data alone, the inventors re-evaluated the perfonnance of algorithms based on 
reflectance or fluorescence data alone using this reduced dataset The inventors also 
evahiated the perfonnance of fluorescence alone at one or two excitation wavelengths 
using this reduced dataset Table Mshowsthenumberof patients and sites where 
both reflectance and fluorescence data were available. Results, reported as sensitivity 
and specificity giving best perfonnance under cross vaUdauon. are shown in Tables 
15-18 for each type of reflectance data. 

The perfonnance of the fluorescence algorithm based on three excitation 
wavelengths does not improve when any of the four types of reflectance daui are also 
incorporated. The perfonnance of fluorescence algorithms based on two excitation 
wavelengths was lower than that for three excitation wavelengths; incorporation of 
any of the four types of reflectance spectia,.did not improve perfonnance. The 
perfonnance of fluorescence algoriduns based on a single exciution wavelengUi was 
lower Uian that for two and three excitation wavelengths. Best results were obtained 
using spectra at 400 nm excitation, tocorporation of any of tiie four types of 
reflectance spectra did not improve performance. 

All of Uie methods and apparams disclosed and claimed herein can be made 
and executed without undue experimentation in light of die present disclosure. While 
Uie apparatus and methods of this invention have been described in tenns of certain 
embodiments, it will be apparent to those of skill in die an that variations may be 
appUed to the methods and/or apparatus described herein wiUiout departing from the 
concq)t, spirit and scope of tiie invention. 
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1. An apparatus for performing fluorescence and spatially resolved reflectance 
spectroscopy on a san^le, comprising: 
a light source; 

a monochromator in optical communication with said light source; 
a reflectance illumination fiber in q)tical communication with said light 
source; 

a fluorescence excitation fiber in optical communication with said 

monochromator; 
an imaging spectrograph; 

a fluorescence collection fiber in optical communicauon with said imaging 
spectrograph; 

a reflectance collection fiber in optical communication with said imaging 

spectrograph and in spaced relation with said reflectance illumination 
fiber; and 

and a detector in optical communication with said imaging spectrograph. 

2. The apparatus of claim 1, wherein said light source, comprises a Xe arc lamp. 

3. The apparaois of claim 1, wherein said monochromator comprises a double 
monochromator. 

4. The apparanis of claim 1, wherein said detector comprises a tiiermo-electrically 
cooled CCD camera. 

5. The apparatus of claim 1, wherein said fluorescence excitation fiber and said 
fluorescence collection fiber are integral. 

6. The apparatus of claim 1, wherein one or more of said fibers are positioned flush 
with said sample. 
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7. The .apparatus of claim 1. further comprising a spacer positioned between one or 
more of said fibers and said sample. 

8. The apparatus of claim 1. wherein said reflectance illumination fiber, said 
fluorescence excitation fiber, said fluorescence collection fiber, and said reflectance 
collection fiber define a fiber optic probe. 

9. The apparatus of claim 8, wherein said probe is configured to be positioned within 
a trocar. 

10. The apparanis of claim 8, wherein said probe comprises a center section and an 
outer section, said fluorescence excitation fiber and said fluorescence coUection fiber 
being positioned in said center section, and said reflectance illumination fiber and said 
reflectance collection fiber being positioned in said outer section. 

11. The apparatus of claim 1. comprising a plundity of fluorescence excitation and 
collection fibers arranged in a circular bundle. . . ;.v. 

12. The apparatus of claim 1. comprising a plurality of reflectance coUecuon fibers 
defining a plurality of collection positions. 

13. The apparatus of claim 12, wherein said plurality of collection positions are 
spaced between about 0 and about 10 millimeters from said reflectance Ulumination 
fiber. 

14. TTie apparatus of claim 1, wherein said reflectance collection fiber defines a 
coUection position at about 180 degrees relative to said rcfleoance illumination fiber. 

15. The apparatus of claim 1, wherein said reflectance collection fiber defmes a 
coUection position at about 90 degrees relative to said reflectance iUumination fiber. 
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16. The apparatus of claim 1, wherein said reflectance collection fiber defines a 
collection position at about 45 degrees relative to said reflectance illumination fiber. 

17. The apparatus of claim 1, further comprising a one or more fibers in optical 

5 communication with said light source and configured to illuminate said sample during 
operation of said apparatus. 

18. The apparatus of claim 1, conq)rising a plurality of fluorescence excitation fibers 
arranged in one or more rows adjacent said monochromator. 

10 

19. The apparams of claim 1, comprising a plurality of fluorescence excitation fibers 
and a plurality of reflectance collection fibers arranged in a single row adjacent said 

. imaging spectrograph. 

15 20. The apparatus of claim 19, further comprising pne or more unconnected fibers 
interspersed with said plurality of fluorescence excitation fibers and said plurality of 
reflectance collection fibers. 

21. The apparatus of claim 1, further comprising a fiber connected fix)m said light 
20 source to said imaging spectrograph to monitor spectral output of said light source. 

22. The apparatus of claim 1, further comprising, a controller coupled to said detector. 

23. An apparatus for measuring fluorescence and spatially resolved reflectance 
25 spectra of a sample, comprising: 

a light source; 

a monochromator in optical communication with said light source; 

a fiber optic probe in optical communication with said light source and with 

said monochromator, said probe comprising a plurality of fluorescence 
30 excitation and collection fibers in spaced relation and a plurality of 

reflectance collection fibers in spaced relation with a reflectance 

illumination fiber; 
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an imaging spectrograph in optical communication with said plurality of 
fluorescence collection fibers and with said plurality of reflectance 
collection fibers; and 

a detector in optical communication with said imaging spectrograph. 

5 

24. The apparatus of claim 23, wherein said plurality of reflectance collection fibers 
and said reflectance illumination fiber are positioned concentrically about said 
plurality of fluorescence excitation and collection fibers. , 

10 25. The apparatus of claim 23, wherein at least one of said plurality of reflectance 
collection fibers defines a collection position at about 180 degrees relative to said 
reflectance illumination fiber. 



26. The apparatus of claim 23, wherein at least one. of said plurality of reflectance 
15 collection fibers defines a collection posidon at about 90 degrees relative to said 

reflectance illumination fiber. 

27. The apparatus of claim 23, wherein at least one of said plurality of reflectance 
collection fibers defines a collection position at al>put 43 degrees relative to said 

20 reflectance illumination fiber. 

28. The apparatus of claim 23, wherein said plurality of collection positions are 
spaced between about 0 and about 10 millimeters from said reflectance illumination 
flber. 

25 

29. The apparatus of claim 23, wherein said probe comprises between twenty-one and 
forty-six optical fibers. 

30. A method for combined fluorescence and spatially resolved reflectance 
30 spectroscopy of a sample, comprising: 

directing radiation to said sample with a fluorescence exciuuon fiber, 
collecting radiation from said sample with a fluorescence collection fiber. 
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directing said radiation from said sample to an imaging spectrograph and a 
detector, 

illuminating said sample with a reflectance illumination fiber, 

collecting reflected light from said sample with a reflectance collection fiber in 

spaced relation with said reflectance illumination fiber, and 
directing said reflected light from said sample to an imaging spectrograph and 

a detector. 



31. The method of claim 30, wherein said coUepting reflected light con^jrises 
collecting reflected light from a plurality of collection positions with a plurality of 
reflectance collection flbers. 



32. The method of claim 30. wherein said coUecting reflected light comprises 
collecting reflected light from said sample with a reflectance collection fiber defining 
a collection position at about 180 degrees relative to said reflectance illumination 
fiber. 

33. The method of claim 30. wherein said collecting reflected light comprises 
collecting reflected light from said sample with a reflectance collection fiber defining 
a coUeaion position at about 90 degrees relative to said reflectance illumination fiber. 

34. The method of claim 30, wherein said collecting reflected light comprises 
collecting reflected light from said sample with a reflectance collection fiber defining 
a collection position at about 45 degrees relative to said reflectance illumination fiber. 

35. The method of claim 30, wherein said sample comprises ovarian, head and neck, 
or cervical tissue. 



36. The method of claim 30, fimher comprising analyzing spectral data from said 
detector to characterize said sample. 
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37. The method of claim 36. wherein said analyzing comprises pre-processing said 
data and reducing a dimension of said data using principal component analysis. 

38. The method of claim 37, wherein said analyzing further comprises selecting one 
or more diagnostic principal components of said data and forming one or more 
algorithms. 

39. The method of claim 38, wherein said analyzing further comprises fonnmg one or 
more composite algorithms. 

40. The method of claim 38, wherein said analyzing further comprises evaluating at 
least on of said algorithms using a cross-validation technique. 

4 1 . A method for combined fluorescence and spatially resolved reflectance 
spectroscopy of a sample, comprising: . ; . 

directing radiation to said sample with a fluorescence excitation fiber, 
collecting radiation from said sample with a fluorescence collection fiber, 
directing said radiation from said sample to an imaging spectrograph and a 
detector 

iUuminating said sample with a reflectance illumination fiber, 

collecting reflected light at a plurality of collection positions from said sample 

witii a plurality of reflectance collection fibers arranged in spaced 

relation; 

directing said reflected light from said sample to an imaging spectrograph and 

a detector to produce spectral data; . . 
pre-processing said data; and 

reducing a dimension of said data using principal component analysis. 

42. The method of claim 41, further comprising selecting one or more diagnostic 
principal components of said data and forming one or more algorithms. 
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43. The oiethod of claim 42. further comprising forming one or more composite 
algorithms. 

44. The method of claim 43, further comprising evaluating at least one of said 
algorithms using a cross-validadon technique. 

45. A method for analyzing spectroscopy data to define an optimized reduced data 
set, comprising: 

pre-processing said spectroscopy data; 

reducing a dimension of said spectroscopy data using principal component 
analysis; and 

selecting one or more diagnostic principal components of said spectroscopy 
data. 

46. The method of claim 45. wherein said spectroscopy data comprises combined 
fluorescence and spatially resolved reflectance spectroscopy data. 

47. The method of claim 45. wherein said pre-processing comprises normalization of 
said spectroscopy data. 

48. The method of claim 45, wherein said pre-processing comprises mean scaling 
said spectroscopy data. 

49. The method of claim 45. wherein said pre-processing comprises calculating one 
or more derivatives on said spectroscopy data. 

50. The method of claim 45. further comprising eliminating redundant data from said 
spectroscopy data. 

5 1 . The method of claim 45. further comprising forming one or more algorithms and 
evaluating at least one of said algorithms using a cross validation technique. 
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32. The jnethod of claim 5 1 . further comprising forming one or more composite 
algorithms. 
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